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Abstract 



This thesis concerns the theoretical foundations of persistence-based topological data 
analysis. The primary focus of the work is on the development of theory of topolog- 
ical inference in the multidimensional persistence setting, where the set of available 
theoretical and algorithmic tools has remained comparatively underdeveloped, rela- 
tive to the 1-D persistence setting. The thesis establishes a number of theoretical 
results centered around this theme, some of which are new and interesting even for 
1-D persistent homology. In addition, this work presents theory of topological infer- 
ence formulated directly on the (topological) level of nitrations rather than on the 
(algebraic) level of persistent homology modules. 

The main mathematical objects of study in this work are interleavings. These 
are tools for quantifying the similarity between multidimensional nitrations and per- 
sistence modules. They were introduced for 1-D nitrations and persistence modules 
by Chazal et al. [8], where they were used to prove a strong and very useful gener- 
alization of the stability of persistence result of [H]; we generalize the definition of 
interleavings appearing in [8] in several directions and use these generalizations to de- 
fine pseudometrics on multidimensional nitrations and multidimensional persistence 
modules called interleaving distances. 

The first part of this thesis, adapted from the preprint [32], studies in detail the 
theory of interleavings and interleaving distances on multidimensional persistence 
modules. We present six main results about the interleaving distance. 

First, we show that in the case of 1-D persistence, the interleaving distance is 
equal to the bottleneck distance on tame persistence modules. 

Second, we prove a theorem which implies that the restriction of the interleaving 
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distance to finitely presented multidimensional persistence modules is a metric. The 
same theorem, together with our first result, also yields a converse to the algebraic 
stability theorem of [8]; this answers a question posed in that paper. 

Third, we present an "extrinsic" characterization of interleaved pairs of multidi- 
mensional persistence modules which makes transparent the sense in which interleaved 
modules are algebraically similar. This characterization turns out to hold for a def- 
inition of interleavings of multidimensional persistence modules rather more general 
than that which we need to define the interleaving distance; the more general form 
of our result is an important ingredient in inferential theory we develop in the second 
part of this thesis. 

Fourth, we observe that the interleaving distance is stable in four senses analogous 
to those in which the bottleneck distance is known to be stable. 

Fifth, we introduce several notions of optimality of metrics on persistence modules 
and show that when the underlying field is Q or a field of prime order, the interleaving 
distance is optimal with respect to one of these notions. This optimality result, which 
is new even for 1-D persistence, is the central result of the first part of this thesis. We 
also prove that a version of this result holds for ordinary persistence modules over any 
field, provided we restrict attention to a class of well behaved ordinary persistence 
modules containing the finitely presented ones. 

Sixth, we show that the computation of the interleaving distance between two 
finitely presented multidimensional persistence modules M and N reduces to deciding 
the solvability of O(logm) systems of multivariate quadratic equations, each with 
0(m 2 ) variables and 0(m 2 ) equations, where m is the total number of generators and 
relations in a minimal presentation for M and a minimal presentation for N. 

In the second part of the thesis, we define interleavings and interleaving distances 
on multidimensional nitrations, and present theoretical results for these. We then use 
interleavings and interleaving distances on multidimensional nitrations to formulate 
and prove several analogues of a topological inference theorem of [11] in the multi- 
dimensional setting, and directly on the level of nitrations. In particular, we employ 
localization of categories, a standard construction in homotopy theory, to define and 
study homotopy theoretic versions of interleavings and the interleaving distance on 
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multidimensional nitrations, which we call weak interleavings and the weak inter- 
leaving distance. We formulate our main inference results using weak interleavings 
and the weak interleaving distance. 

To describe these results in detail, let 7 : R m — > R be a probability density 
function on R m , and for z G Z> let T z be an i.i.d. sample of size z of a probability 
distribution with density 7. Let F z Ce be the random Cech bifiltration with vertices 
T z , filtered by the superlevelsets of E(T Z ), where E is a density estimator, and by the 
usual scale parameter for Cech complexes. 

Our first main inference result is that under mild conditions on 7 and E, Ff Ce 
converges in probability (with respect to the weak interleaving distance, and as z — > 
00) to a bifiltration constructed directly from 7, which we call the superlevel-offset 
bifiltration. 

Our second main inference result is an analogue of the first result for Vietoris-Rips 
bifiltrations, filtered by the superlevelsets of E(T Z ) and by the usual scale parameter 
for Vietoris-Rips complexes. 

We also present analogues for each of these results for probability density functions 
defined on Riemannian manifolds. 

These inference results on the level of nitrations yield as corollaries analogous 
results on the level of persistent homology, formulated in terms of the interleaving 
distance and interleavings on persistence modules. Our extrinsic characterization 
of interleavings from the first part of our thesis yields concrete interpretations of 
these corollaries as statements about the similarity between presentations of persistent 
homology modules. 
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Chapter 1 
Introduction 



This thesis concerns the theoretical foundations of persistence-based topological infer- 
ence. It focuses in particular on topological inference in the multidimensional setting 
and at the level of filtrations. 

In this introduction, we offer some context and motivation for the theory developed 
in this thesis and present an overview of our results. The chapter is divided into 
six sections: Section 11.11 introduces the broader context for the mathematics of this 
thesis and discusses the need for further development of the statistical foundations of 
topological data analysis; Section [TT21 introduces the problem of developing inferential 
theory for multidimensional filtrations, and presents background and motivation for 
the problem; Sections 11.3111. 51 present detailed overviews of the results of chapters 
2-4 of this thesis; and Section 11.61 closes this introduction with discussion of our 
motivation for considering in this thesis persistence based inference directly at the 
level of filtrations. 

This introduction is not intended as a tutorial; we will assume that the reader has 
some familiarity with some of the basic terminology and ideas of applied topology 
and topological data analysis. See the reviews [221 EH SI and the textbook [23] for 
treatment of the basics, and Sections 12.11 13.11 and 14.11 for foundational definitions. 
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1.1 On The Need for Firm Statistical Foundations 
of Topological Data Analysis 

To explain the mathematical context of this thesis, it will be useful for us to begin 
by formulating definitions of topological inference and topological data analysis. 

Recall first that in statistics, we distinguish between descriptive statistics and sta- 
tistical inference. Descriptive statistics, as the name suggests, is that part of statistics 
concerned with defining and studying descriptors of data. It involves no probability 
theory and aims simply to offer tools for describing, summarizing, and visualizing 
data. Statistical inference, on the other hand, concerns the more sophisticated enter- 
prise of estimating descriptors of an unknown probability distribution from random 
samples of the distribution. The theory and methods of statistical inference are built 
on the tools of descriptive statistics: The estimators considered in statistical inference 
are of course, when stripped of their inferential interpretation, merely descriptors of 
data. 

We define descriptive topological data analysis (descriptive TDA) to be 

the branch of descriptive statistics which uses topology to define and study qualitative 
descriptors of data sets. 

We define topological inference to be the branch of statistical inference which 

1. uses topology to define qualitative descriptors of probability distributions. 

2. develops and studies estimators for inferring such descriptors from finite samples 
of the distributions. 

We define topological data analysis (TDA) to refer collectively to descriptive 
TDA, topological inference, and the applications of these to science and engineering. 

In the last 10 years the TDA community has introduced a number of novel tools 
for descriptive TDA jlHl [7J [5J HTJ [161 1]) an d has begun developing applications of 
these in science and engineering. There is great interest in applying these tools to 
the study of random data, and a good deal of work has already been done in this 
direction E2J EH EE] • 

In the last few years, there also has been some important progress in topological 



CHAPTER 1. INTRODUCTION 



3 



inference [351 EJ [10] • However, the development of tools for descriptive TDA has, 
as a rule, outpaced the development of the theory of topological inference supporting 
the use of these tools in the study of random data. Indeed, statistical foundations 
for many of the most discussed tools in TDA — for example, Mapper [H], circle val- 
ued coordinatization [IB] , and persistent homology of Vietoris-Rips (multi-)filtraitons 
[7] — have either not been laid out at all, or have only partially (and recently) been laid 
out. In many cases, there is no theoretical framework in place for interpreting these 
descriptors of data as appropriately behaved estimators of descriptors of probability 
distributions. (An important exception is the work of Chazal et al. [11] on estimating 
the superlevelset persistent homology of density functions using Rips complexes; we'll 
discuss this work in detail in Section [1.2.21 ) 

Additionally there is, to date, little to no discussion in the TDA literature on 
how to compute theoretically sound measures of confidence for any estimator of a 
topological descriptor. This last gap is an especially critical one in the statistical 
foundations of TDA: It is a basic principle of statistical inference that an estimate 
of some descriptor of a probability distribution is only meaningful insofar as we also 
have a good measure of confidence for that estimate. 

In these senses, the statistical foundations of TDA remain quite underdeveloped; 
much further progress on the theory of topological inference is needed before the tools 
of TDA can sit comfortably amongst the more conventional tools in a statistician's 
toolbox. 

Carrying out this work is central to the program of fully realizing TDA as a 
data analysis methodology of value in science and engineering. Those of us working 
in TDA agree that, broadly speaking, TDA offers a very natural and powerful set 
of tools for qualitative statistical inference, even if the mathematical foundations of 
these tools are still in development. We share a common goal of seeing the machinery 
of TDA mature to the point that it can make an impact on science and engineering 
commensurate with what we believe its potential to be. For that to happen, a great 
deal of work needs to be done in a number of directions, but one of the most direct 
ways we as researchers in TDA can hasten the integration of TDA into the modern 
data analysis pipeline is by properly fleshing out the statistical foundations of TDA. 
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After all, if the statistical foundations of topological data analysis were firmer, the 
output of the tools of topological data analysis on random input would be more 
meaningful and more useful, and presumably statisticians, scientists, and engineers 
would be more receptive to the tools and more inclined to invest the time to develop 
new applications of them. 

This perspective motivates the work of this thesis. Indeed, this work is an effort 
to contribute to the program of fleshing out the statistical foundations of TDA. The 
program is a broad one, however, and the results of this thesis amount only to a 
narrow slice of what is needed to really put TDA on firm statistical footing. Never- 
theless, it is my hope that the results presented here can offer clarity on some basic 
issues in topological inference and TDA and open the door for further progress in the 
development of the theory. 

1.2 Overview: Inferential Theory for Multidimen- 
sional Filtrations 

We focus here on the theoretical foundations of persistence-based topological data anal- 
ysis, that central (though not all-encompassing) branch of topological data analysis 
that considers topological descriptors and random variables defined using filtrations 
and persistent homology. Our aims in particular are, first, to develop an inferential 
theory for multidimensional persistence, and, second, to develop inferential theory at 
the level of filtrations, rather than merely at the level of persistent homology modules. 

1.2.1 Context and Motivation for Our Inference Results 

The history of the problem of developing an inferential theory for multidimensional 
persistent homology dates back several years, to the original paper on multidimen- 
sional persistent homology [TJ. Motivated by needs arising in their study of natural 
scene statistics [6], in [7J Carlsson and Zomorodian proposed the use of Vietoris-Rips 
bifiltrations to probe the qualitative structure of point cloud data of nonuniform den- 
sity in exploratory data analysis applications. This proposal was motivated by the 
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idea that when the point cloud data set T z is obtained as an i.i.d. sample of size 
z of some probability distribution with density function 7 : M m — > M., the random 
Vietoris-Rips bifiltration with vertices T z , filtered by superlevelsets of a density esti- 
mator E and by the usual scale parameter for a Vietoris-Rips complex, should encode 
topological information about 7. 

A central aim of this thesis is to put this idea on firm mathematical footing. 

Our approach builds in an essential way on recent work on persistence-based 
topological inference by Chazal, Guibas, Oudot, and Skraba [IT] . In that work, the 
authors build on theory developed in their earlier papers [8j [12] to prove a result 
[UJ Theorem 5.1] on the inference of the 1-D (superlevelset) persistent homology of 
a probability density function 7 from a pair of filtered Vietoris-Rips complexes built 
on i.i.d. samples of a probability distribution with density 7. The primary aim of the 
paper [11] is to leverage persistent homology to introduce a clustering algorithm with 
good theoretical properties; the exposition there is such that the inference result [TTj 
Theorem 5.1] plays a supporting role, serving as the engine for the development of 
theoretical guarantees on the clustering algorithm presented in that papeio Never- 
theless, the result is a very significant one in of itself: It proves (implicitly, at least), 
for the first time, the consistency of an estimator of the superlevelset persistent ho- 
mology of a density function on a Euclidean space. Moreover, because that estimator 
is defined using only a pair of Vietoris-Rips complexes, it is simple and quite com- 
putable. One disadvantage of the estimator introduced in [11], however, is that its 
construction depends on a choice of scale parameter; as noted in [IT] , a correct value 
for this parameter can be difficult to choose in practice. 

It is natural to ask if and how the topological inference result [UJ Theorem 5.1] 
might adapt to the multidimensional setting, and whether it might also adapt to 
yield inference results directly on the level of nitrations rather than on the level of 
persistent homology. In Chapter 4 of this thesis, we will show that such adaptations 
are indeed possible. In fact, we will see that the multidimensional setting has the 
advantage that in constructing estimators there analogous to that considered in [11] , 



1 Presumably, the authors make this expository choice because the inference result [HI Theo- 
rem 5.1] is, modulo some details, a reasonably easy corollary of a deterministic result that is the 
centerpiece of the earlier paper |12j . 
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we do not need to choose a scale parametero 

1.2.2 The Result of Chazal et al. on the Inference of Sub- 
levelset Persistent Homology Using Rips Complexes 

To understand the context of our inference results, it is important to understand the 
result [HI Theorem 5.1]. We now present an asymptotic corollary of [Til Theorem 
5.1] which will serve as our mathematical point of departure in our pursuit of our 
own inference results. 

To prepare for the result, we review some basics about density estimation. 

Density Estimation Preliminaries 

Let D(R m ) denote the set of probability density functions on M. m . Define a density 
estimator Bona M m to be a sequence of functions {E z : (W m ) z — > T>(M. m )} ze ^ such 
that for each zeN, the restriction of E z to any point in W 71 is a measurable function 
from (M m ) z to R. In formulating our results, we will consider pairs (^,E), where 7 
is a density function and E is a density estimator, for which one of the following two 
assumptions holds: 

Al: E converges uniformly in probability to 7. 

A2: E converges uniformly in probability to the convolution of 7 with some kernel 
function K. 

Al is known to hold for kernel density estimators, for a wide class of kernels and 
density functions 7, provided the kernel width tends to at an appropriate rate as 
z tends to infinity. A2 is known to hold for the kernel density estimator with kernel 
K (with fixed width as the number of samples z varies) for a wide class of kernels K 
and density functions 7. See Section f4. 1.31 for more details and references. 

2 For both the estimator considered in and the estimators we consider here, it is necessary 
to chose a bandwidth parameter for a density estimator. In the Euclidean case, the need to select 
a bandwidth parameter is not an unmanageable difficulty, at least in low dimensions: For density 
functions on Euclidean spaces, the problem of optimally selecting a bandwidth parameter for a kernel 
density estimator has been well studied — see [39] . 
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Our Asymptotic Corollary of the Inference Result |11L Theorem 5.1] 

The form of the result we quote here is different in several ways than that of [TTj 
Theorem 5.1]. First, as we noted above, rather than recall the original form of [TTj 
Theorem 5.1], we will present a tidier asymptotic corollary of it whose form is closer 
to that of our main inference results. The proof of the asymptotic corollary, given 
the original form of the result, follows from an e-8 argument very similar to that used 
in the proof of our Theorem 14.5.21 Second, because of the emphasis on clustering in 
[llj . the result [TTJ Theorem 5.1] is stated only for th persistent homology, but as the 
authors note, the result adapts immediately to higher persistent homology via the 
results of [12]. We will present here a version of the asymptotic corollary which holds 
for i th persistent homology, i G Z>o. Third, to minimize the technicalities in this 
introduction, we present the result for density functions defined on Euclidean space; 
the result adapts readily to Riemannian manifolds with sectional curvature bounded 
above and below. 

Let 7 : M. m — > K be a c-Lipchitz density function such that for % G Z>o, the 
superlevelset persistent homology module Hi(—j) is tamej; let T z be an i.i.d. random 
sample of a probability distribution with density 7 of size z; let d p be the restriction 
of some LP metric to T z , for 1 < p < 00; and let E be a density estimator. 

For 5 > 0, let F SR (T Z , d p , -E(T Z ), 5) be a Rips filtration on (T z ,d p ) with fixed 
scale parameter 5, filtered by sublevelsets of —E(T Z ). For i G Z> , let Hi denote the 
■th p er si s t en t homology functor. The inclusion 

F SR (T Z , d p , -E(T Z ), 6) ^ F SR (T Z , d p , -E(T Z ), 25) 

induces a homomorphism 

j z>5 : Hi(F SR (T z , cf, -E{T Z ), 5)) -> H % {F SR {T Z , d p , -E(T Z ), 25)) 

of persistence modules. lm(j z ^ z ) is then itself a persistence module. 



3 See Section [2.1.3l for the definition of a tame persistence module. The tameness condition ensures 
that the bottleneck distance in the statement of [11] Theorem 5.1] is well defined. 
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For a 1-D persistence module M, let M + denote the submodule of M generated 
by homogeneous summands M a of M with a > and let Rq(M) = M/M + . 

Let d,B denote the bottleneck distance on tame persistence modules [SJ. If {X z } z(3i 
is a sequence of random variables, and X is a random variable such that X z converges 
in probability to X, we write X; — > X. 

Theorem 1.2.1 (Inference result of [11] (asymptotic form)). 
(%) If {^,E) satisfies Al then 3 a sequence {8 z } z ^m such that 

(as z — » 00 / 

(^mJ // (7, -E) satisfies A2 for some kernel K then 3 a sequence {5 2 } 2 eN snc/i t/iat 

d fl (i2o(ImO«„,)), MHi{-l * K))) A 0. 

1.2.3 Distances and Other Notions of Proximity on Filtra- 
tions and Persistence Modules 

One of the main goals of this work is to adapt the consistency result Theorem 11.2.11 
to the multidimensional setting and to the level of nitrations. Since Theorem 11.2.11 is 
formulated in terms of the bottleneck distance, to carry out this adaptation we require 
analogues of this distance on multidimensional persistence modules and nitrations. 
The main obstacle to adapting Theorem 11.2.11 to the multidimensional setting is 
that the bottleneck distance of ordinary persistent homology does not admit a naive 
extension to the multidimensional setting. Similarly, the main obstacle to adapting 
the theorem to the level of nitrations is that the bottleneck distance does not admit 
a naive adaptation to a distance on nitrations, even in 1-D. 

Most of the effort of this thesis is put not directly towards proving inference results, 
but rather towards introducing and studying generalizations of the bottleneck distance 
to multidimensional nitrations and persistence modules which we need to formulate 
our inference results. Indeed, one of the main themes of this work is that developing 



CHAPTER 1. INTRODUCTION 



9 



the theory of topological inference largely boils down to the problem of selecting 
and understanding distances (or more general notions of proximity, which strictly 
speaking, are not distances in any reasonable mathematical sense) on appropriate 
objects: Once we develop the mathematical vocabulary needed to properly formulate 
our inference results in this thesis, the proofs of the results turn out to be reasonably 
straightforward, given existing ideas in the literature (and particularly those put forth 
[TT]). The main mathematical challenge then is to make sense of the distances and 
notions of proximity between nitrations and between persistence modules with which 
we formulate the inference results, and to show that these notions are, in suitable 
senses, the right ones to use. 

We formulate our inference results using interleavings and distances defined in 
terms of interleavings called interleaving distances. As noted in the abstract, inter- 
leavings are tools for quantifying the similarity between multidimensional nitrations 
and persistence modules. Interleaving distances are generalizations of the bottleneck 
distance to pseudometrics on multidimensional nitrations and on multidimensional 
persistence modules. The most basic type of interleavings, called e-interleavings, 
were introduced for 1-D persistence nitrations and persistence modules by Chazal 
et al. [Sj. This thesis introduces generalizations of the definition of e-interleavings 
given in [5] to definitions of e-interleavings on multidimensional nitrations and multi- 
dimensional persistence modules. We define interleaving distances in terms of these 
generalized e-interleavings. 

In fact, in this thesis we generalize the notion of e-interleavings yet further to 
arrive at the definition of (Ji, J^-interleavings, which we also sometimes call gen- 
eralized interleavings. We apply interleaving distances and ( J%, ^-interleavings 
to formulate results on multidimensional inference at the level of nitrations. (Ji, J2)- 
interleavings allow us to quantify anisotropic and asymmetric similarities between 
nitrations and between persistence modules that cannot be completely described us- 
ing interleaving distances. They turn out to be the right mathematical tools for 
interpreting random Vietoris-Rips bifiltrations as inferential objects. 
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1.3 Chapter 2: Interleavings and Interleaving Dis- 
tances for Multidimensional Persistence Mod- 
ules 

In Chapter 2 of this thesis, we introduce interleavings and the interleaving distance 
dj on multidimensional persistence modules, and treat the theory of these in detail. 
Chapter 2 of this thesis is adapted from the preprint [32] ; the content of Chapter 2 is 
very close to that of |32j. The most important difference is that we define interleavings 
in greater generality and extend our extrinsic characterization of e-interleavings in [32] 
to an extrinsic characterization of generalized interleavings. There are some other 
minor differences between Chapter 2 of the thesis and [52], but none of any great 
significance. 

1.3.1 Results 

In Chapter 2, we present six main results on interleavings and the interleaving dis- 
tance. The first result, Theorem I2.4.2[ shows that in the case of ordinary persistence, 
the interleaving distance is in fact equal to the bottleneck distance on tame persistence 
modules. Our proof relies on a generalization of the structure theorem [16] for finitely 
generated ordinary persistence modules to (discrete) tame persistence modules. This 
generalization is proven e.g. in [4l5] . 

Our second main result is Theorem 12.5.1} which tells us that if M and N are 
two finitely presented persistence modules and di(M,N) = e then M and N are e- 
interleaved. As an immediate consequence of this theorem, we have Corollary 12.5.21 
which says that the interleaving distance restricts to a metric on finitely presented 
persistence modules. Theorems 12.4.21 and 12.5.11 together also yield Corollary I2.5.3[ a 
converse to the algebraic stability theorem of |8j. The converse says, firstly, that if 
two tame 1-D persistence modules M and N are are distance e apart in the bottleneck 
distance, then they are e+5-interelaved for any 5 > 0. Secondly, the converse says that 
if in addition M and iV are each finitely presented (which is stronger than tameness), 
then M and N are in fact e-interleaved. This result answers a question posed in [8]. 
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Our third main result is Theorem I2.6.4[ an "extrinsic" characterization of (Ji, J2)- 
interleaved pairs of persistence modules; it expresses transparently the sense in which 
( Ji, ^-interleaved persistence modules are algebraically similar. Since e-interleavings 
are a special type of (Ji, ^-interleaving, and since the interleaving distance is defined 
in terms of e-interleavings, the result also yields an extrinsic characterization of the 
interleaving distance. The result is reminiscent of the extrinsic characterization of the 
Gromov-Hausdorff distance, which expresses the Gromov-Hausdorff distance between 
two compact metric spaces in terms of the Hausdorff distance between embeddings 
of two metric spaces into a third metric space. Roughly speaking, Theorem 12.6.41 
characterizes e-interleaved pairs of persistence modules in terms of a distance between 
embeddings of presentations of such modules into a free persistence module. 

As noted above, Theorem 12.6.41 was presented in [32J only for the special case of 
e-interleavings. The more general form of our result will be an important ingredient 
of the inferential theory we develop in the second part of this thesis. 

Our fourth result is the observation that the interleaving distance is stable in four 
senses analogous to those in which the bottleneck distance is known to be stable. 
These stability results, while notable, require very little mathematical work; two of 
the stability results turn out to be trivial, the third follows from a minor modification 
of an argument given in [9], and the fourth admits a straightforward proof. 

Our fifth main result, Corollary 12. 10. 2\ is an optimality result for the interleaving 
distance. It tells us that when the underlying field is Q or a field of prime order, 
the interleaving distance is stable in a sense analogous to that which the bottleneck 
distance is shown to be stable in [T?| [S], and further, that the interleaving distance 
is, in a uniform sense, the most sensitive of all stable pseudometrics. This "maxi- 
mum sensitivity" property of the interleaving distance is equivalent to the property 
that, with respect to the interleaving distance, multidimensional persistent homology 
preserves the metric on source objects as faithfully as is possible for any choice of 
stable pseudometric on multidimensional persistence modules; see Remark 12.9.31 for 
a precise statement. Our optimality result is new even for 1-D persistence. In that 
case, it offers some mathematical justification, complementary to that of [TH |Sj, for 
the use of the bottleneck distance. 
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In fact, provided we restrict attention to a class of well behaved ordinary persis- 
tence modules containing the finitely presented ones, the assumption that the under- 
lying field is Q or a field of prime order is unnecessary; our Theorem 12.10.81 gives an 
analogue of Corollary 12.10.21 for this class of modules, over arbitrary fields. 

The main ingredient in the proof of Corollary 12. 10.21 is our characterization result 
Theorem 12.6.41 Using Theorem 12. 6. 4} we present a constructive argument which 
shows that when the underlying field is Q or a field of prime order, e-interleavings 
can, in a suitable sense, be lifted to a category of M n -valued functions. This is 
Proposition 12.10.41 From this proposition, our optimality result follows readily. 

Given our first five main results, it is natural to ask if and how the interleaving 
distance can be computed. Our sixth main result speaks to this question. The result, 
which follows from Theorem 12. 1 1 .51 and Proposition 12. 11 S)\ is that the computation of 
the interleaving distance between two finitely presented multidimensional persistence 
modules M and iV reduces to deciding the solvability of O(logm) systems of multi- 
variate quadratic equations, each with 0(m 2 ) variables and 0(m 2 ) equations, where 
m is the total number of generators and relations in a minimal presentation for M and 
a minimal presentation for N. This result is just a first step towards understanding 
the problem of computing the interleaving distance; we hope to address the problem 
more fully in future work. 

A Note On Prior Work 

After making the results of the first part of this thesis publicly available (in form 
of the preprint [32J ) , it was brought to our attention that in [21J, dAmico et al. 
proved an optimality result for the bottleneck distance similar to the optimality results 
given here, for the special case of 0-dimensional ordinary persistent homology. Our 
Theorem 12.10.81 generalizes a slight weakening of [2T| Theorem 32]; see Remark l2. 10.91 



CHAPTER 1. INTRODUCTION 



13 



1.4 Chapter 3: Strong and Weak Interleavings and 
Interleaving Distances for Multidimensional Fil- 
iations 

In Chapter 3 we introduce and study interleavings and interleaving distances on 
multidimensional nitrations. The first goal of Chapter 3 is to present theory for inter- 
leavings on nitrations analogous to that which Chapter 2 presents for interleavings on 
multidimensional persistence modules; the second goal of the chapter is to establish 
the technical foundations needed to formulate and prove topological inference results 
directly at the level of nitrations. 

We in fact introduce two types of interleavings on multidimensional nitrations, 
strong interleavings and weak interleavings; each induces a distance on multi- 
dimensional nitrations, the strong interleaving distance dsi and the weak inter- 
leaving distance d\yi, respectively. We define strong interleavings in a way closely 
analogous to the way in which we define interleavings on persistence modules, and 
they share some of the favorable theoretical properties of interleavings on persistence 
modules. 

However, strong interleavings turn out to be too sensitive for the purpose of prov- 
ing inference results analogous to Theorem ll.2.1l at the level of nitrations. The reason, 
put somewhat coarsely, is that strong interleavings are sensitive to the topology of 
the spaces in nitrations up to homeomorphism; in developing theory of topological 
inference, it turns out to be necessary to work with a notion of interleaving that is 
sensitive only to the homotopy type of the spaces in the nitrations. We thus intro- 
duce a homotopy theoretic variant of strong interleavings called weak interleavings, 
which we define using localization of categories, a standard construction in homotopy 
theory. We use weak interleavings and the weak interleaving distance to formulate 
our main inference results on the level of nitrations. 

Because our inference results at the level of nitrations are formulated using weak 
interleavings rather than strong interleavings, we are less interested in understanding 
strong interleavings and the strong interleaving distance than their weak counterparts. 
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Nevertheless, we believe the theory of strong interleavings is worth developing, if only 
as a bridge to developing the theory of weak interleavings. 

1.4.1 Strong Interleavings and the Strong Interleaving Dis- 
tance 

Section l3T2l presents the definition and basic theory of strong interleavings. We present 
three main results on the theory of strong interleavings. 

Our first main result, Theorem I3.2.7[ is a characterization of strongly interleaved 
pairs of filtrations of nested type — nitrations of nested type are simply nitrations, 
each of whose transition maps is an injection; all filtrations we have occasion to 
consider in the development of our inferential theory in Chapter 4 are of nested type. 

Our characterization of strongly interleaved pairs of filtrations of nested type 
is loosely analogous to our characterization Theorem 12.6.41 of interleaved pairs of 
persistence modules in Chapter 2. Whereas our characterization of interleaved pairs of 
persistence modules in Chapter 2 is given in terms of free covers of persistence modules 
(i.e. the th modules in free resolutions of persistence modules) our characterization 
of strongly interleaved pairs of filtrations of nested type is given in terms of colimits 
of filtrations. 

Theorems 13.2.101 and 13.2.1 H our second and third main results on the theory of 
strong interleavings, are optimality results for dsi, each analogous to our optimality 
result Corollary 12.10.21 for dj. We prove these using Proposition 12.10.41 (which, as 
noted above, is the main step in our proof of Corollary 12. 10.21 our optimality result for 
multidimensional persistence modules) and our characterization result Theorem 13. 2 .71 

1.4.2 Weak Interleavings and the Weak Interleaving Distance 

In Section 13.31 we define and study weak interleavings and the weak interleaving 
distance. Our definition of weak ( J 1( J 2 )-interleavings, via which we formulate our 
second main inference result, Theorem 14.4.41 is perhaps the most interesting object 
of study of this thesis. 



CHAPTER 1. INTRODUCTION 



15 



We'll now discuss at a high level the motivation for the definition of weak inter- 
leavings, the use of localization of categories in formulating the definition, and the 
theory of weak interleavings. 

Motivation For Considering Weak Interleavings 

As we have noted above, strong interleavings are too sensitive for our use in formu- 
lating our inference results. We begin by explaining the sense in which this is so: 
To adapt Theorem 11.2.11 to the level of nitrations, we want a notion of distance d 
between interleavings between nitrations such that if / : X — y Y is a morphism of 
nitrations^ which is a levelwise homotopy equivalence, meaning that each component 
map f a : X a — > Y a is a homotopy equivalence, then d(X, Y) = 0. The reason we 
want a distance on nitrations with this property is simple: To adapt the proof of 
Theorem 11.2.11 to proofs of inference results at the level of nitrations, we need an 
adaptation of the persistent nerve lemma of [13] which holds on the level of nitra- 
tions rather than only on the level of persistent homology; such an adaptation can be 
formulated in terms of a distance d on nitrations with the above property. 

When we give the definition of d$i(X, Y) in Section I3"72"t it will be easy to see that 
it is not true that dsi(X, Y) = whenever there is a levelwise homotopy equivalence 
between X and Y; see Remark 14.2.21 for a counterexample. On the other hand, our 
Proposition 14.2.11 shows that dwi does have this property. In fact, we formulate the 
definitions of weak interleavings and dwi so as to explicitly enforce the property. 

Localization of Categories and Weak Interleavings 

To define weak interleavings, we first modify the category of multidimensional nitra- 
tions by formally adjoining inverses of levelwise homotopy equivalences. The math- 
ematical tool for this is localization of categories. Localization of categories is anal- 
ogous to the localization of rings and modules in commutative algebra. In analogy 
with those versions of localization, it is characterized by a simple universal property. 



l See Section [2.7.21 for the definition of a morphism of nitrations. 
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Localization of categories is intimately connected to homotopy theory and in partic- 
ular to closed model categories and the axiomatic homotopy theory of Quillen [57] . 
We discuss this connection in Section 13.3.11 

We observe that we can define interleavings between nitrations in a localized 
category of multidimensional nitrations in much the same way that we define strong 
interleavings in the ordinary category of multidimensional nitrations. We define weak 
interleavings to be the interleavings in the localized category, and then define dwi in 
terms of weak interleavings. 

Properties of Weak Interleavings and the Weak Interleaving Distance 

Our definitions are such that dwi < dsi- On the other hand, dwi is sensitive enough 
that persistent homology functors defined over arbitrary commutative coefficient rings 
are stable, with respect to the weak interleaving distance on n-filtrations and the 
interleaving distance on persistence modules. This is the content of our Theorem 13. 3. 9 1 
As we will see, this theorem is quite useful in passing from inference results on the 
level of nitrations to inference results on the level of persistent homology. 

In this thesis, we do not prove an optimality result for dwi analogous to those 
which we prove for di and dsi- Such a result, if we had it, would offer a fuller picture 
of the sensitivity properties of dwi- Also, we do not offer a geometrically transparent 
characterization of dwi analogous either to the algebraic characterization of dj given 
by Theorem 12 . 6 . 41 or to the geometric characterization of dsi given by Theorem 13.2.71 
Thus, in spite of our Theorem 13.3.91 and the close analogy between dwi and both dsi 
and di, for which this thesis presents transparent characterizations, at the conclusion 
of this work dwi still remains somewhat of a mysterious object. An optimality result 
for dwi and a geometrically transparent characterization of dwi would do much to 
remove the shroud of mystery around dwi- In Section l3.3.7[ we discuss these problems 
further and offer a conjectural optimality result for dwi- 
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1.5 Chapter 4: The Inferential Interpretation of 
Random Cech and Rips Multifiltrations 

In Chapter 4 we present our main inference results for multidimensional persistence, 
Theorems 14.4.21 and 14.4.41 These adapt Theorem ll.2.1l to the multidimensional setting 
and to the level of nitrations. We present two main inference results: The first 
concerns inference using Cech bifiltrations and the second concerns inference using 
Rips bifiltrations. 

We also present two additional results, one deterministic and one probabilistic, 
which are similar in spirit to our main results. These are our Theorems 14.2.101 
and 14X21 

1.5.1 Definitions of Filtrations 

To state our first main inference result, we need to define a few types of filtrations. 
All definitions we present here also appear later in the thesis in fuller generality; see, 
in particular, Sections 12.71 and 13.11 

For m,d6 (— oo, oo], a (u, v )-filtration X (of nested type) is a collection of topolog- 
ical spaces {X( a £)} a <u,b<v sucn that if (a, b) < (a', b') (w.r.t. the usual partial ordering 
on IR 2 ) then X( a! fy C X^^y We can define the (multidimensional) persistent homol- 
ogy functor of a (u, v) -filtration for any (u,v) G (—oo, oo] 2 — see Section 13.1.21 

For an (oo, oo)-filtration X and u, v 6 (—00,00], let R( U)V \(X) denote the (u, in- 
filtration such that for (a, b) < (u,v), R(u,v)(X)( a ,b) — ^a,b- 

Superlevelset-Offset Filtrations 

We now introduce sublevelset-offset filtrations and superlevelset-offset filtrations. In- 
formally, whereas the topology of a superlevelset filtration encodes only the height 
of topographical features of the graph of an R-valued function, the topology of a 
superlevelset-offset filtration simultaneously encodes both the height and the width 
of those topographical features. 

Superlevelset-offset filtrations are a natural common extension of superlevelset 
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filtrations and offset filtrations [13], two types of 1-D nitrations which are standard 
objects of study in the computational topology literature. 

If (Y, d) is a metric space, X C Y, and 7 : X — > R is a function, F so (X, Y, d, 7), 
the sublevelset-offset filtration of 7 (w.r.t the metric d), is the (00, oo)-filtration 
for which 

F so (X, Y, d, T ) a , 6 = {ye Y\d(y, 7^(00, a])) < 6}. 

We write F so (Y, Y, d, 7) simply as F so (Y, d, 7). 

Informally, F (X, Y, d, j) a ,b is the thickening of the a-sublevelset of 7 by offset 
parameter b. We call F so (X, Y, d, —7). the superlevelset-offset filtration of 7. 

Superleveset-Cech Bifiltrations 

Given X, Y, and d as above and feel, let Cech (X, Y, d, b), the (closed) Cech complex 
of (X, d) with parameter 6, be the abstract simplicial complex with vertex set X such 
that for I > 2 and x±, X2, ■■, x\ G X, Cech (X, Y, d, b) contains the (/ — l)-simplex 
[xi, Xi] iff there is a point y G F such that Xj) < 6 for 1 < i < I. 

For any 7 : X — > R, let F 5<5e (X, F, d, 7), the sublevelset-Cech filtration of 7 
(w.r.t the metric d), be the (00, oo)-filtration for which 

F sGe (X } y, d, 7)0,5 = Cech (7" 1 ((oo, a]),Y, d, b). 

We call F SCe (X, Y, d, —7) the superlevelset-Cech filtration of 7. 

Superlevelset-Rips Bifiltrations 

Given a metric space (X, d) and b G R, let Rips(X, d, 6), the Rips complex of (X, d) 
with parameter b, be the maximal abstract simplicial complex with vertex set X 
such that for Xi,x 2 G X, the 1-skeleton of R(X,d,b) contains the edge [xi,^] iff 
d(xi, X2) < 2b. 

For any 7 : X — > R, let F SR (X, d, 7), the sublevelset-Rips filtration of 7 (w.r.t 
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the metric d), be the (oo, oo) -filtration for which 

F SR (X, d, 7)0,6 = Rips( 7 - 1 ((oo, a]),d,b). 
We call F SCe (X,d, —7) the superlevelset-Rips filtration of 7. 



1.5.2 An Inferential Interpretation of Superlevelset-Cech Bi- 
filtrations 

We now describe our first main inference result. 

For simplicity, we state the result here only for density functions on Euclidean 
space; it adapts readily to the case of density functions on Riemannian manifolds. 

Let 7 : lR m — > R be a c-Lipchitz density function for some c > 0; for zeff, let T z 
be an i.i.d. sample of size z of a random variable with density 7; fix p G [1, 00], let d p 
denote both the L p -metric on W 71 and (by slight abuse of notation) the restriction of 
the d p to any subset of W 71 ; let £ be a density estimator on MP 1 . 

Our first main result, Theorem I4.4.2[ is the following. 

Theorem. 

(i) If (7, E) satisfies Al then 

tM^cc^^'C^R^d^ A 0. 

(ii) If (7, E) satisfies A2 for a kernel K then 

dwi(R(o,oo)(F sGe (T z , R m , d p , -E(T Z ))), R {0>oo) (F so (R m , dP, -7 * K))) A 0. 

Note that unlike Theorem II. 2. H the statement of this result does not involve a 
sequence {S z } z£ z of scale parameters. 

Let Hi denote the i th persistent homology functor (the definition is given in 
Section I3.1.2p . From our stability result for the weak interleaving distance, Theo- 
rem I3.3.9| we immediately obtain the following corollary, which is Corollary 14.4.31 
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Corollary. 

(i) If(j,E) satisfies Al then 

di(H i (R(< d)00 )(F s ^ e {T Z) R m , <F, —E(T Z )))), Hi (R(p,oo) (E so (R m , ef, -7)))) A 0. 

(ii) If (7, E) satisfies A2 for a kernel K then 

di{Hi{R{Q y00 ){F s ^ e {T Z) R m , d p , -E{T Z )))), Hi {R(p,oo) (F so (R m , d p , -j*K)))) A 0. 

Theorem 12. 6.4[ our extrinsic characterization of (Jj, J2)-interleaved pairs of mod- 
ules, offers a concrete interpretation of Corollary 14.4.31 as a statement about the 
similarity between presentations of persistent homology modules. 

Theorem 14.4.21 and Corollary 14.4.31 also immediately give us corresponding results 
about Rips nitrations, in two special cases. First, it is easy to check that for any 
T C R m , and any scale parameter a > 0, Rips(T, d°°, a) = Cech (T, R m , d°°, a). Thus 

F sGe {T z , R m , d°°, -E{T Z )) = F SR {T Z , d°°, -E{T Z )) 

and so when p = 00, we may replace the superlevelset-Cech nitrations F SCe (T z , R m , d p , —E{T Z )) 
with the superlevelset-Rips notations F SR (T z ,d p , —E(T Z )) everywhere in the state- 
ments of the above theorems. 

Second, since the superlevelset-Rips nitrations and superlevelset-Cech nitrations 
always have equal 1-skeletons, 

H (R i0jOo) (F sGe (T z ,dP z ,-E(T z )))) = H Q (R (0tOo) (F SR (T z ,d? z ,-E(T z )))) 

in the the statement of Corollary I4.4.3[ and so the corollary gives us in particular a 
consistency result for the the th persistent homology of superlevelset-Rips nitrations. 
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1.5.3 An Inferential Interpretation of Superlevelset-Rips Bi- 
filtrations 

For 7,T 2 , and E as in the previous section, write Ff R = F SR (T z ,d p , —E(T Z )) and 
psce = p sGe (T z , M m , dP, -E(T Z )), and F so = F so (W m , R m , dP, -7))). 

Our second main inference result, Theorem I4.4.4[ is an analogue of our first main 
inference result for the random superlevelset-Rips bifiltrations F Z R . The result is 
formulated directly in terms of weak interleavings, rather than in terms of dwi- Be- 
cause of this, the precise statement of the result is technical and not readily presented 
before giving a careful treatment of weak (Ji, ^-interleavings. Thus we will describe 
the result here but defer its statement to Section 14.4.41 

Theorem 14.4.41 quantifies the sense in which F 2 is topologically similar to F so 
in the asymptotic limit as z — > 00, under the same assumptions as in our first main 
inference result. Roughly, the result says that in the limit as z — > 00, F^ R and F 
satisfy in the weak sense the same interleaving relationship that F^ R and F^ Ce satisfy 
in the strong sense for all z (with probability 1). 

The result is not, in any reasonable sense, a consistency result. Indeed, Ff Ce and 
Ff R exhibit topological differences that do not become negligible with high proba- 
bility as z 00. Thus, given that under the assumptions of our first main inference 
result, the bifiltrations F 2 are (in a topological sense) consistent estimators of F , 
we do not expect that under the same assumptions the bifiltrations F^ R would also be 
consistent estimators of F so . On the other hand, the topological differences between 
Ff and F z are controlled by the well known inclusion relationships between Cech 
complexes and Rips complexes: Recall that for any metric space (Y,d), X C Y and 
any scale parameter r > 0, we have that 

Cech (X, Y, d, r) C Rips(X, d, r) C Cech (X, F, d, 2r). (1.1) 

Using the language of weak ( Ji, J 2 )-interleavings, we can encode the relationship 
between Ff Ce and Ff R induced by these inclusions. Then, via a simple triangle in- 
equality type lemma for interleavings, we can quantify how that relationship, taken 
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together with our first main inference result, controls the topological differences be- 
tween F^ R and F so in the asymtoptic limit. Theorem 14.4.41 does exactly this. 

As in the case of our first main result, via the stability Theorem 13.3.91 we ob- 
tain an analogue of Theorem 14.4.41 for persistent homology modules, Corollary 14.4.51 
Corollary 14.4.51 is formulated in terms of (Ji, J2)-interleavings of persistence modules. 
As is the case for Corollary 14.4.31 Theorem 12.6.41 offers a concrete interpretation of 
Corollary 14.4. 5l as a statement about the similarity between presentations of persistent 
homology modules. 

1.5.4 Deterministic Approximation Results 

In proving our inference results, we follow a strategy analogous to that used by Chazal 
et. al to prove the inference result (TTj Theorem 5.1]-namely we first prove deter- 
ministic approximation results which assume that the domain of the functions we 
consider is well sampled, and then use these deterministic results to obtain proba- 
bilistic results. 

In specific, we first prove Theorem 14.2.31 a bound on the weak interleaving dis- 
tance between the sublevelset-offset filtration of a function 7 and the sublevelset-Cech 
filtration of an approximation of 7 defined on a finite subset of the domain of 7. This 
bound implies an analogous result, Theorem 14.2.71 f° r sublevelset-Rips nitrations, 
formulated in terms of weak (J 1; J 2 )-interleavings. These results are the content of 
Sections 14.2.21 and 14.2.31 they are analogues of [121 Theorem 3.1], and its extension 
[TTl Theorem 4.5], but in the multidimensional setting and for nitrations rather than 
for persistent homology modules. In fact, our results treat the general situation that 
7 is an W 1 - valued function, n > 1, and that only a sublevelset of the domain of 7 is 
well sampled. 

The result [TTl Theorem 4.5] for ordinary persistence is interesting and useful, in- 
dependent of its application to statistical inference — see, for example, the application 
presented in [T3]. We imagine that the deterministic bounds we present here may 
similarly be of independent interest. 
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1.5.5 Deterministic Approximation of Multidimensional Sub- 
levelset Persistent Homology 

Using the interleaving distance on multidimensional persistence modules, it also is 
possible to adapt the deterministic result [TH Theorem 4.5] to the multidimensional 
setting in a different — and more straightforward — way than that of Theorems 14.2.31 
and 14.2.71 Our Theorem 14.2.101 generalizes the result [HI Theorem 4.5] to M™- valued 
functions, using sublevelset multifiltrations (as defined e.g. in Section 12. 7.4|) . and 
sublevelset-Rips multifiltrations with a fixed scale parameter. The proof is essentially 
same as that of [H] Theorem 4.5] in the case of K- valued functions. 

This result does not lift to the level of nitrations, but an analogue of it for 
sublevelset-Cech nitrations rather than sublevelset-Rips nitrations does in fact lift 
to the level of nitrations. This is our Theorem 14.2.111 

1.5.6 An Inference Result for 1-D Filtrations 

In addition to our main inference results, described above, in Section 1431 we apply the 
weak interleaving distance and Theorem 14.2.111 to obtain an inference result, Theo- 
rem 14.5.2^ for the superlevelset filtration of a probability density function. This is a 
variant of Theorem 1 1 . 2 . 1| holding at the level of filtrations. In general, the result holds 
only for filtered Cech complexes, not for filtered Rips complexes — Theorem ll.2.1[ as 
stated above for filtered Rips complexes, does not lift to the level of filtrations. How- 
ever, as for Theorem 14.4. 2\ Theorem 14.5.21 can of course be interpreted as a result 
about Rips filtrations in the special case that we construct our filtrations using L°° 
metrics on M. m . 

As we have already noted, our proof of Theorem 14.5.21 adapts to give a proof of 
Theoem II. 2. 1^ which we stated but did not prove. 



CHAPTER 1. INTRODUCTION 



24 



1.6 Why Formulate Theory of Topological Infer- 
ence Directly at the Level of Filtrations? 

To close this introduction, we explain our choice to develop inferential theory directly 
at the level of filtrations rather than only at the level of persistent homology modules. 

Our aim is to understand as deeply as possible the connection between the topo- 
logical structure of discrete filtrations built on randomly sampled point cloud data 
and that of filtrations built directly from the probability distribution generating the 
random data; as we'll now argue, to do this, it is more natural to develop the in- 
ferential theory (and, in particular, the requisite notions of similarity of filtrations) 
directly at the level of filtrations than at the homology level. 

We note first the trivial fact that singular homology induces an equivalence re- 
lation on topological spaces — namely, we can say that two topological spaces X an 
Y are homology equivalent if H^X) and H%{Y) are isomorphic for all i. However, 
such a notion of equivalence between topological spaces is not often adopted as a 
fundamental object of interest in algebraic topology. The reason is that there are 
stronger, more easily defined, and more geometrically transparent notions of equiva- 
lence of topological spaces, such as those of homotopy equivalence and weak homotopy 
equivalence, which exhibit the kinds of invariance properties one wants a notion of 
equivalence on spaces to have in algebraic topology and homotopy theory^] Thus 
homotopy equivalence and weak homotopy equivalence are generally regarded as the 
fundamental notions of equivalence of topological spaces in algebraic topology, and 
homology serves primarily as a computational tool to understand spaces up to these 
notions of equivalence; homology is usually not taken as means of defining a notion 
of equivalence of topological spaces in of itself. 

Now, for filtrations the analogue of homology is persistent homology, and we can 

5 Note: This not standard terminology; we are introducing it here only in the service of the present 
discussion and only for use in this subsection. 

6 When we say that homotopy equivalence is a geometrically transparent notion of equivalence 
of topological spaces we are alluding specifically to the theorem which says that two spaces are 
homotopy equivalent if and only if they are each deformation retracts of some third space |29[ 
Corollary 0.21]. 
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define a notion of persistent homology equivalence of nitrations in a way analogous to 
the way in which we just defined homology equivalence of topological spaces. In fact, 
for ordinary persistence the bottleneck distance ^ on persistence modules affords us 
an approximate notion of persistent homology equivalence between two 1-D nitrations: 
we can interpret the statement that 

V*GZ> , d B (Hi{X),Hi(X))<e 

for two nitrations X and Y as saying that X and Y are approximately persistent 
homology equivalent, up to an error of e. More generally, the interleaving distance in- 
troduced in this thesis affords us in the same way an approximate notion of persistent 
homology equivalence between two multidimensional nitrations. 

Given that for topological spaces it turns out to be most natural to regard ho- 
motopy equivalence, rather than homology equivalence, as the fundamental notion of 
equivalence between topological spaces, by analogy it is reasonable for us to ask for a 
notion of approximate homotopy equivalence of nitrations which has good invariance 
properties and which enjoys the same advantages over approximate homology equiv- 
alence that homotopy equivalence enjoys over homology equivalence for topological 
spaces. Further, if we were to have such a notion, then in light of its good properties 
and by analogy to classical algebraic topology, it would be natural for us to formulate 
persistence theory and in particular theory of topological inference directly at the 
level of nitrations in terms of that notion. 

The weak interleaving distance affords us precisely such a well behaved notion 
of approximate homotopy equivalence of multidimensional nitrations, except that, as 
we noted in Section 11.4.21 it is not yet clear to what extent the weak interleaving 
distance is in fact geometrically transparent. 

Thus, the present unavailability of a geometric characterization of the weak inter- 
leaving distance aside, it is natural to formulate our main inference results directly 
at the level of nitrations, in terms of the weak interleaving distance and the closely 
related notion of weak interleavings. 



Chapter 2 

Interleavings on Multidimensional 
Persistence Modules 

. In this chapter we introduce and study interleavings and interleaving distance on 
multidimensional persistence modules. See Section [L3l for an overview of the chapter. 

2.1 Algebraic Preliminaries 

In this section we define persistence modules and review some (primarily) algebraic 
facts and definitions which we will need throughout the thesis. 

2.1.1 First Definitions and Notation 

Basic Notation 

Let k be a field. Let N denote the natural numbers and let Z>o denote the non- 
negative integers. Let R = (—00,00]. 

We view R n as a partially ordered set, with (ai, a n ) < (bi, b n ) iff dj < foj for 
all i. Let denote the i th standard basis vector in W 1 . 

For A C R any subset, let A = A U {—00, 00}. 

For a = (ai, a n ) e R n and b e R, let (a, b) = (a u a n , b) e W l+1 . 
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For / : X ->■ R n , a G R n , let / = {x G X|/(x) < a}. We call f a the a-sublevelset 
of f and we call (—f) a the a-superlevelset of f. 

For eel, let e* n G M n denote the vector whose components are each e. We'll also 
often write e n simply as e when n is understood. As a notational convenience, for 
u G W l and e G M, let w + e denote u + e '. 

Notation Related to Categories 

For a category C, let obj(C) denote the objects of C and let obj*(C) denote the 
isomorphism classes of objects of C. Let hom(C) denote the morphisms in C, and 
for X, Y G obj(C) let homc(X, Y) denote the morphisms from X to Y. When C is 
understood, we'll often write homc(X, Y) simply as hom(X, Y). 

Metrics, Pseudometrics, and Semi-pseudometrics 

Recall that a pseudometric on a set X is a function d : X x X — > [0, oo] with the 
following three properties: 

1. d(x, x) = for all x G X . 

2. y) = d(y, x) for all x, y G X. 

3. <i(x, z) < y) + d(y, z) for all x,y,z G X. 

We'll often use the term distance in this thesis as a synonym for pseudometric. 

A metric is a pseudometric d with the additional property that d(x, y) ^ when- 
ever x / i/. We define a semi-pseudometric to be a function rf:lxl-} [0, oo] 
satisfying properties 1 and 2 above. 

Metrics on Categories 

In this thesis we'll often have the occasion to define a pseudometric on obj*(C), 
for C some category. For d such a pseudometric, M, N G obj(C), and [M],[7V] 
the isomorphism classes of M and N, we'll always write d(M, N) as shorthand for 
d([M],[N}). 
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2.1.2 Commutative Monoids and Commutative Monoid Rings 

Monoid rings are generalizations of polynomial rings. 

A commutative monoid is a pair (G, +g), where G is a set and +g is an associative, 
commutative binary operation on G with an identity element. Abelian groups are by 
definition commutative monoids with the additional property that each element has 
an inverse. We'll often denote the monoid (G, +g) simply as G. A submonoid of a 
monoid is defined in the obvious way, as is an isomorphism between two monoids. 

Given a set S, let k[S] denote the vector space of formal linear combinations of el- 
ements of S. If G = (G, +g) is a monoid, then the operation + G induces a ring struc- 
ture on k[G], where multiplication is characterized by the property {kigi)(k 2 g2) = 
kik 2 (gi +g 92) for fci, k 2 G k, #1, #2 £ G. We call the resulting ring the monoid ring 
generated by G, and we denote it k[G). 

Let A n denote k[x±, ...,x n ], the polynomial ring in n variables with coefficients in 
k. For n > 0, Z> is a monoid under the usual addition of vectors. It's easy to see 
that fc[Z> ] is isomorphic to A n . 

Similarly, M> is a monoid under the usual addition of vectors. Let B n denote the 
monoid ring fc[R> ]. We may think of B n as an analogue of the usual polynomial ring 
in n- variables where exponents of the indeterminates are allowed to take on arbitrary 
non-negative real values rather than only non-negative integer values. With this 
interpretation in mind, we'll often write (r 1; ...,r n ) as x T ^x r 2 ■ ■ -x^ 1 , for (r 1; ...,r n ) G 

2.1.3 Multidimensional Persistence Modules 

We first review the definition of a multidimensional persistence module given in [7]. 
We then define analogues of these over the ring B n . 

In what follows, we'll often refer to multidimensional persistence modules simply 
as "persistence modules." 
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A„-Persistence Modules 

Fix n G N. An A„-persistence module is an A„-module M with a direct sum 
decomposition as a A;-vector space M = aeZ n M a such that the action of A n on M 
satisfies Xj(M ) C M a+Gi for all a G Z n . In other words, an A n -persistence module is 
simply an A n -module endowed with an n-graded structure. 

For M and N A n -persistence modules, we define hom(M, N) to be the set of 
module homomorphisms / : M — > N such that /(M ) C N a for all a G Z n . This 
defines a category whose objects are the A n -persistence modules. Let A n -mod denote 
this category. 

5 ra -persistence modules 

In close analogy with the definition of an n-graded A n -module, we define a B n - 
persistence module to be a -B n -module M with a direct sum decomposition as a 
A;- vector space M = agR n M a such that the action of B n on M satisfies x°(M a ) C 
M a+aei for all a G W 1 , a > 0. 

For M and _B n -persistence modules, we define hom(M, N) to consist of module 
homomorphisms / : M ->■ N such that f(M a ) C N a for all a G R n . This defines 
a category whose objects are the £> n -persistence modules. Let -B n -mod denote this 
category. 

Our notational convention will be to use boldface to denote A n -persistence mod- 
ules and italics to denote -B n -persistence modules. We'll often refer to Ai-persistence 
modules and ^-persistence modules as ordinary persistence modules. 

On the Relationship Between A n -persistence Modules and £? n -persistence 
Modules 

Since A n is a subring of B n , we can view B n as an A n -module. If M is an An- 
persistence module then M ®a„ B n is a S n -module. Further, M ®a„ B n inherits 
an n-grading from those on B n and M which gives M ® An B n the structure of a 
-B n -persistence module. 

In fact, (•) ®a„ B n defines a functor from A n -mod to _B Tt -mod. It can be checked 
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that this functor is fully faithful and descends to an injection on isomorphism classes 
of objects. Thus the functor induces an identification of A„-mod with a subcategory 
of B n -mod. 

In light of this, we can think of -B n -persistence modules as generalizations of A n - 
persistence modules. Finitely presented _B n -persistence modules arise naturally in 
applications, as discussed in Section 12 .7.41 In a sense that can be made precise using 
machinery mentioned in Remark 12.3.31 it is possible to view them as A n -persistence 
modules endowed with some additional data. However, this is awkward from the 
standpoint of constructing pseudometrics between S n -persistence modules. We thus 
regard £> n -persistence modules as the fundamental objects of interest here, and use 
A„-persistence modules in this thesis only in the case n = 1 to translate results about 
Ai-persistence modules into analogous results about ^-persistence modules. 

In the remainder of Section I2.1.3[ we present some basic definitions related to 
-B n -persistence modules. All of these definitions have obvious analogues for An- 
persistence modules; we'll use these analogues where needed without further com- 
ment. 

Homogeneity 

Let M be a 5 n -persistence module. For u G M n , we say that M u is a homogeneous 
summand of M. We refer to an element v G M u as a homogeneous element of grade 
u, and write gr(v) = u. A homogeneous submodule of a _B n -persistence module is 
a submodule generated by a set of homogeneous elements. The quotient of a -Bn- 
persistence module M by a homogeneous submodule of M is itself a _B n -persistence 
module; the n-graded structure on the quotient is induced by that of M. 

Tameness 



Following [8] we'll call a -Bn-persistence module tame if each homogeneous summand 



of the module is finite dimensional. Note that this is a more general notion of tameness 
than that which appears in the original paper on the stability of persistence [14"] . 
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Transition Maps 

For M a -B„-persistence module and any u < v <E W 1 , the restriction to M u of 
the action on M of the monomial a^ 1- " 1 ^ 2- " 2 • • ■ x^~ Un defines a linear map with 
codomain M v , which we call a transition map. Denote this map by ipm(u,v). 

2.1.4 Shift Functors and Transition Morphisms 
Shift Functors and Related Notation 

For v G R n , we define the shift functor (-)(i>) as follows: For M a -B n -persistence 
module we let M(v) be the _B n -persistence module such that for all a G M™, M(v) a = 
M a+V . For a < b G M n , we take ipM(v)( a , b) = <fM(a + v,b + v). For / G hom(5 n -mod) 

We let f(v) a = fa+v 

For v G W l and / G hom(_B„-mod), we'll sometimes abuse notation and write f{v) 
simply as /. 

For e G R, let M(e) denote M{e). More generally, for any subset Q C M, let 
Q(e) C M(e) denote the image of Q under the bijection between M and M(e) induced 
by the identification of each summand M(e) u with M u+e . 

Transition Homomorphisms 

For a 5 n -persistence module M and e G M> let S(M, e) : M ->■ M(e), the (diagonal) 
e-transition homomorphism, be the homomorphism whose restriction to M u is the 
linear map <^m(w, u + e) for all u G M n . 

2.1.5 e-interleavings and the Interleaving Distance 

For e > 0, we say that two £?„-persistence modules M and are e-interleaved if 
there exist homomorphisms / : M — >■ N(e) and g : N ^ M(e) such that 

g(e)of = S(M,2e) and 
/(e)o£ = S(iV,2e); 
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we refer to such / and g as e-interleaving homomorphisms. 

The definition of e-interleaving homomorphisms was introduced for .Bx-persistence 
modules in [SJ. 

Remark 2.1.1. It's easy to see that if < ex < t2 and M and N are ei-interleaved, 
then M and N are e2-interleaved. 

In Section 12.6.11 we will observe that the definition of e-interleavings generalizes 
considerably, but for now we will not concern ourselves with the generalized form of 
the definition. 

The Interleaving Distance on i?„-persistence modules 

We define dj : obj*(-B n -mod) x obj*(-B ri -mod) — > [0, oo], the interleaving distance, 
by taking 

di(M, N) = inf{e € M> |M and N are e- interleaved}. 

Note that dj is pseudometric. However, the following example shows that di is 
not a metric. 

Example 2.1.2. Let M be the ^-persistence module with Mq = k and M a = 
if a 7^ 0. Let N be the trivial -Bi-persistence module. Then M and iV are not 
isomorphic, and so are not 0-interleaved, but it is easy to check that M and iV are 
e- interleaved for any e > 0. Thus dj(M, N) = 0. 

2.1.6 Free ^-persistence Modules, Presentations, and Re- 
lated Algebraic Basics 

In our study of _B n -persistence modules, we'll make substantial use of free S n -persistence 
modules and presentations of £> n -persistence modules; we define these objects here 
and present some basic results about them. We begin with some foundational defini- 
tions. 
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n-graded Sets 

Define an n-graded set to be a pair G = (G, lq) where G is a set and lq '■ G — > M. n is 
any function. When iq is clear from context, as it will usually be, we'll write lg{v) 
as gr(y) for y G G. We'll sometimes abuse notation and write G to mean the set G 
when no confusion is likely. The union of disjoint graded sets is defined in the obvious 
way. For e > and G = (G, Lq) an n-graded set, let G(e) be the n-graded set (G, i' G ), 
where i' G {y) = i(y) - e. 

For G an n-graded set, define gr(G) : M. n — > Z> U {oo} by taking gr(G){u) 
to be the number of elements y G G such that gr(y) = u. Note that for any B n - 
persistence module M, a set Y C M of homogeneous elements inherits the structure 
of an n-graded set from the graded structure on M, so that gr(Y) is well defined. 

Free Z? n -persistence modules 

The usual notion of a free module extends to the setting of _B n -persistence modules 
as follows: For G an n-graded set, let (G) = © ye g-B n (— gr{y)). A free B n -persistence 
module F is a ^-persistence module such that for some n-graded set G, F = (G). 

Equivalently, we can define a free _B n -persistence module as a 5 n -persistence mod- 
ule which satisfies a certain universal property. Free A n -persistence modules are de- 
fined via a universal property e.g. in [7J Section 4.2]. The definition for £? n -persistence 
modules is analogous; we refer the reader to [7] for details. 

A basis for a free module F is a minimal set of homogeneous generators for F. 
For G any graded set, identifying y G G with the copy of 1(— gr(y)) in the summand 
B n (—gr(y)) of (G) corresponding to y gives an identification of G with a basis for 
(G) . It can be checked that if B and B' are two bases for a free _B n -persistence module 
F then gr(B) = gr(B'). Clearly then, gr(B) of an arbitrarily chosen basis B for F is 
an isomorphism invariant of F and determines F up to isomorphism. 

For R a homogeneous subset of a free S n -persistence module F, (R) will always de- 
note the submodule of F generated by R. Since, as noted above, R can be viewed as an 
n-graded set, we emphasize that for such R, (R) does not denote ® y ^RB n {—gr{y)). 
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Free Covers and Lifts 

For M a £> n -persistence module, define a free cover of M be a pair (F M ,p M ), where 
Fm is a free I? n -persistence module and pu '■ Fm — > M a surjective morphism of 
-B n -persistence modules. 

For M, A/" -B n -persistence modules, (F M ,p M ) and (F N ,p N ) free covers of M and 
N, and / : M — > N a morphism, define a Zz/t of / to be a morphism / : Fm — > F^ 
such that the following diagram commutes. 



'M 



f 



PM 



Pn 



M - 

Lemma 2.1.3 (Existence and Uniqueness up to Homotopy of Lifts). For B n -persistence 
modules M and N, free covers (Fm, Pm), {Fn, Pn) of M, N, and a morphism f : M — )■ 
£/iere exists a lift f : Fm —> F^ of f . If f : Fm — > F^ is another lift of f , then 
im(f-f) C ker(pjv). 

Proof. This is just a specialization of the standard result on the existence and ho- 
motopy uniqueness of free resolutions [251 Eisenbud A3. 13] to the th modules in free 
resolutions for M and N. The proof is straightforward. □ 



Presentations of £? n -persistence Modules 

A presentation of a _B n -persistence module M is a pair (G, R) where G is an n-graded 
set and R C (G) is a set of homogeneous elements such that M = (G) / (R) . We denote 
the presentation (G,R) as (G\R). For n-graded sets Gi, ...,Gi and sets R\, ...,R m C 
(Gi U ... U Gi), we'll let (Gi, Gi\Ri, R m ) denote (Gi U ... U Gi\R x U ... U R m ). 

If M is a -B n -persistence module such that there exists a presentation for 
M with G and R finite, then we say M is finitely presented. 
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Minimal Presentations of _B„-persistence Modules 

Let M be a _B n -persistence module. Define a presentation (G\R) of M to be minimal 
if 

1. the quotient (G) — >■ (G) / (i?) maps G to a minimal set of generators for (G) / (R) . 

2. R is a minimal set of generators for (R). 

It's clear that a minimal presentation for M exists. 

Theorem 2.1.4. If M is a finitely presented B n -persistence module and (G\R) is a 
minimal presentation of M, then for any other presentation (G'\R') of M, gr(G) < 
gr(G') and gr{R) < gr(R'). 

Note that the theorem implies in particular that if (G\R) and (G'\R') are two 
minimal presentations of M then gr(G) = gr(G') and gr(R) = gr(R'). 

We defer the proof of the theorem to Appendix IA.21 The proof is an adaptation to 
our setting of a standard result [251 Theorem 20.2] about free resolutions of modules 
over a local ring. The main effort required in carrying out the adaptation is to prove 
that the ring B n has a property known as coherence; we define coherence and prove 
that B n is coherent in Appendix IA.ll 

2.2 Algebraic Preliminaries for 1-D Persistence 

In this section, we review algebraic preliminaries and establish notation specific to 
1-D persistent homology. This material will be used in Sections 12.31 and [2.41 to develop 
the machinery needed to prove Theorem 12.4.21 

Basic Notation 

For S any subset of M. 2 , let S + = {(a, b) G S\a < b}. For S a set and / : S — > K a 
function, let supp(/) = {s£ S\f(s) ^ 0}. 
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2.2.1 Structure Theorems For Tame Ai-Persistence Modules 

The structure theorem for finitely generated Ai persistence modules is well known 
in the applied topology community. In fact, this theorem generalizes to tame Ai- 
modules. The existence portion of the generalized theorem is given e.g. in [45]; the 
uniqueness is not mentioned there but is very easy to show; we do so below. To our 
knowledge, this generalization has not previously been discussed in the computational 
topology literature. We will use the more general theorem to show that the bottleneck 
distance is equal to the interleaving distance for ordinary persistence. 

Before stating the results, we establish some notation. For a < b G Z, Let C(a, b) 
denote the module (k[x]/(x b ~ a ))(— a). Let C(a, oo) = k[x](—a). Note that for fixed b 
(possibly infinite), the set of modules {C(a, &)} a e(-oo,b) has a natural directed system 
structure; let C(— oo, b) denote the colimit of this directed system. 

For M a module and m e Z> , let M m denote the direct sum of m copies of M. 

Theorem 2.2.1 (Structure Theorem for finitely generated Ai-persistence modules 
[46]). Let M be a finitely generated Ai-module. Then there is a unique function 
V M : (Z x Z) + — > Z> with finite support such that 

M^( a , i)aup p( 1)M )C(a,^(»- 6 '. 

Theorem 2.2.2 (Structure Theorem for tame Ax-persistence modules [45J). Let M 
be a tame K\-module. Then there is a unique function Dm : Z^_ — > Z> such that 

M = © (a , 6)esupp(7 , M) C(a,6)^^. 

The uniqueness part of Theorem l2.2.2l is an immediate consequence of the following 
lemma, upon noting that the right hand sides of the equations in the statement of 
the lemma do not depend on £>m- 

Lemma 2.2.3. Let M be a tame Ai-module, and let Dm : Z^_ — > Z>o be a function 
such that M = © (a ,6) e supp(© M )C(a, b) v ^ b \ Then 
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(i) For (a,b) G Z 2 + , 

T>M{a, b) = rank(^ M (a, b— 1))— rank(< / 9 M (a, b))— rank(<^ M (a— 1, b— l))+rank(<^M(a 

(ii) Forb G Z, X> M (-oo,6) = lim a ^_oo rank(^ M (a, 6-l))-lim _ + _ 0Q rank(^ M (a, &))■ 
(mj For a G Z ; D M (a, oo) = Hindoo rank(<^ M (a, 6)) — lim^oo rank(<y2 M (a — 1, b)). 
(iv) V M (-oo, oo) = lim a _ > _ 00 lim 6 _ ) . 00 rank(<^ M (a, 6)). 

Proof. This is trivial. □ 

We call Pm the (discrete) persistence diagram of M. 

In Section 12.31 we prove a structure theorem analogous to Theorem 12.2.21 for a 
subset of the tame ^-persistence modules which contains the finitely presented B\- 
persistence modules. We do not address the problem of generalizing this structure 
theorem to the full set of tame -Bi-persistence modules, but to echo a sentiment 
expressed in [8] , it would be nice to have such a result. 

2.2.2 Discrete Persistence Modules. 

In order to define persistence diagrams of .^-persistence modules, we need a mild 
generalization of Ax-persistence modules. 

Let S C R be a countably infinite set with no accumulation point. The authors 
of [8] define a discrete persistence module Ms to be a collection of vector spaces 
{M s } s( z S indexed by S together with linear maps {ipM s ( s i> s 2)}si<s 2 eS'- 

Define a grid function t : Z — > K to be a strictly increasing function with no 
accumulation point. 

Remark 2.2.4. Discrete persistence modules are of course closely related to Ai- 
persistence modules. A countably infinite subset of S C R with no accumulation 
point can be indexed by a grid function t with image S, and such a grid function is 
uniquely determined by the value of t(0). Thus, pairs (Ms, s), where Ms is a discrete 
persistence module and s is an element of S, are equivalent to pairs (M', t), where M' 
is an Ai-persistence module and t is a grid function; there is an equivalence sending 
each pair (M s ,s) to the pair (M',t), where t is a grid function with im(t) = S, 
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t(0) = s, and M' is the Ax-persistence module such that for z G Z, TS/L' Z = Mt< z ) and 
Vm'(z 1 ,z 2 ) = (PM a (t(zi),t(z2)). 

As a matter of expository convenience, from now on we'll define discrete modules 
to be pairs (M, t) where M is an Ax-persistence module and t is a grid function. This 
in effect means we are carrying around the extra data of an element of S in our discrete 
persistent modules relative to those defined in [Sj, but this won't present a problem-in 
particular, the definition of the persistence diagram of a discrete persistence module 
that we present below is independent of the choice of this element, and is equivalent 
to that of 0. 



2.2.3 Persistence Diagrams and the Bottleneck Distance 

The definition of a persistence diagram that we present here differs in some cosmetic 
respects from that in [8]. Our choice in this regard is a matter of notational conve- 
nience; the reader may check that our definition of the bottleneck distance between 
tame .^-persistence modules is equivalent to that of [8]. 
For a grid function t, define t : Z — > R as 



t(z) 



t(z) if z e Z, 
— oo if z = — oo, 
oo if z = oo. 



Let t x t : Z 2 + -> R 2 + be defined by t x t(a, b) = (t(a),t(b)). 

We define a persistence diagram to be a function T> : — > Z> . 

For (M, t) a discrete persistence module, define ^(m,*), the persistence diagram 
of (M, t), to be the persistence diagram for which supp(P(M,t)) —ix t(supp(2\i)) 
and so that T>(M,t)(i( a ), t(b)) = Dm(i^) for all (a, b) e Z^_. 



Bottleneck Metric 

For x G R, define x + oo = oo and x — oo = — oo. Then the usual definition of I 
norm on the plane extends to R 2 ; we denote it by || ■ W^. 
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Now define a multibijection between two persistence diagrams T> 1 ,T> 2 to be a 
function 7 : supp(X>i) x supp(X> 2 ) — > ^>o such that 

1. For each x G supp(X>i), the set {y G supp(V 2 )\(x,y) G supp(7)} is finite and 

^i(^) = XI 7(^,2/), 
j/esupp(r>2) 

2. For each y G supp(£> 2 ), the set G supp(X>i)|(x, y) G supp(7)} is finite and 

V 2 {y)= 7(^,2/). 

zGsupp('Di) 

For persistence diagrams T>i, T> 2 , let L(Z>i, X> 2 ) denote the triples (T>[, T>' 2) 7), where 
T>[, and P 2 are persistence diagrams with T>[ < T>i, T>' 2 < T> 2 , and 7 is a multibijection 
between T>[ and T> 2 . 

We define the bottleneck metric d B between two persistence diagrams V 1 ,T> 2 



as 



dsi'Di, V 2 ) — inf max 



( | \ 

sup -(b — a), sup ||y — x 

(a,6)Gsupp(Z>i-£>i) ^ (x,j/)gsupp(7) 



eL(Di,D 2 ) \ U supp(Z> 2 -Z>£) / 

Discretizations of i^-modules 

Let t be a grid function. For M a .Si-persistence module, we define the t-discretization 
of M to be the discrete persistence module (P t (M),t) with P t (M) defined as follows: 

1. For z G Z, P t (M) a = M t{z) ; let X M ,t, 2 : Pt(M) z ->■ M tW denote this identifica- 
tion. 

2. For y,z£li,y<z, VFt{M) {y,z) = X^ tz o ip M (t(y),t(z)) ol MAy . 
Persistence diagrams of fix-persistence modules 

We'll say a grid function t is an e-cover if for any oGl, there exists b G im(t) with 
\a — b\ < e. Now fix a G K and let {U}^ be a sequence of grid functions with ti a 
l/2 l -cover. 
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It is asserted in [8] that for any tame -Bi-persistence module M the persistence 
diagrams X , (p t .(jvr),t < ) converge in the bottleneck metric to a limiting persistence di- 
agram T>m and that V M is independent of the choice of the sequence {tj}. We call 
T>m the persistence diagram of M. For M and N tame -Bi-persistence modules, we 
define d B {M,N) = d B (V M ,V N ). 

Remark 2.2.5. Two non-isomorphic tame -Bi-persistence modules can have identical 
persistence diagrams. For example, take M and N to be the -Bi-persistence modules 
of Example 12.1.21 M and N are not isomorphic but it is easy to check that they have 
the same persistence diagram. Thus ds defines a pseudometric (but not a metric) on 
isomorphism classes of tame -Bi-persistence modules. 

2.3 A Structure Theorem for Well Behaved B\- 
persistence modules 

In this section we prove an analogue of Theorem 12.2.21 for a certain subset of the tame 
-Bi-persistence modules which we call the well behaved persistence modules. The set of 
well behaved persistence modules contains the set of finitely presented -Bi-persistence 
modules. These modules are in a sense "essentially discrete." Indeed, they are exactly 
the -Bi-persistence modules that are the images of tame Ai-persistence modules under 
a certain family of functors from Ai-mod to -Bi-mod. 

Our strategy for proving the structure theorem for well behaved persistence mod- 
ules is to exploit Theorem \2.2.2\ taking advantage of the functorial relationship be- 
tween Ai-persistence modules and well behaved Si-persistence modules. 

2.3.1 Well Behaved Persistence Modules 

A critical value of a -Bi-persistence module M is a point a G M such that for no 
e G M>o is it true that for all u < v G [a — e, a + e], (Pm(u, v) is an isomorphism. 
We'll say a tame -Bi-persistence module M is well behaved if 

1. The critical values of M are countable and have no accumulation point. 
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2. For each critical point a of M, there exists e > such that (pM(a,y) is an 
isomorphism for all y G [a, a + e] . 

Proposition 2.3.1. A finitely presented Bi-persistence module is well behaved. 

Proof. Let M be a finitely presented -E>i-persistence module and let U C R be the set 
of grades of the generators and relations in a minimal presentation for M. (It follows 
from Theorem 12.1.41 that U is well defined). Lemma [2.5.41 below tells us that for any 
a < b G R such that (a, b] D U — 0, <pm(o,, b) is an isomorphism. Since Z7 is finite, the 
result follows immediately. □ 

Let t be a grid function. Define t -1 : R — > Z by t~ l (y) = max{z G Z|t(,z) < y}. 
Define r 1 : R -> Z by 



ifwGR, 
— oo if u = — oo, 
oo if u — oo. 



We'll now define a functor ^ : Ai-mod— > i?i-mod as follows: 

1. Action of E t on objects: For M an Ai-persistence module and mgR, E t (M.) u = 
Mf-i^j; let JM,t,u '■ E t (M.) u — > M(-i/ u ) denote this identification. For u,oel, 

U < V, let (p Et (M)(u,v) = JM,t,v ° ^M^^ 1 ^),^ -1 ^)) O J M ,t,u- 

2. Action of E t on morphisms: For M and N Ax-persistence modules and / G 
hom(M,N), define E t (f) : E t (M) -> £ t (N) by letting £? t (/)„ = ^o/ H(s) o 
Jwi,t,u fc> r all m G R. 

We leave to the reader the easy verification that E t is in fact a functor with target 
-Bi-mod. 

It's clear that if M is a tame Ai-persistence module, then for any grid function 
t, E t (M.) is tame. Moreover, it's easy to check that for any grid function t and any 
tame Ai-persistence module M, E t (M.) is well behaved. 

Conversely, we have the following: 



CHAPTER 2. INTERLEAVINGS ON MULTI-D PERSISTENCE MODULES 42 



Proposition 2.3.2. If M is a well behaved Bi-persistence module, then there is some 
tame Ai-persistence-module M and some grid function t such that M = E t (M). 

Proof. Let t : Z — > R be a grid function whose image contains the critical points of 
M. Let (M, t) denote the t-discretization of M, as defined in Section l2.2.3l M clearly 
is tame. We'll show that M = E t (M). 

For u G R, define a u : E t (M) u -> M u by a u = tp M (t o o I M ,t,t- 1 (w) ° 

J~bA,t,u- By definition, jM,t,u an d lM,t,t~ l {u) are isomorphisms. Moreover, a simple 
compactness argument shows that since M is well behaved, <^m(£ ° is an 

isomorphism. Thus o~ u is an isomorphism. 

We claim that the collection of maps {cr u } ue K defines an isomorphism of modules. 
To see this, we need to show that for all u, v e R, u < v, cr v o{p Et ^ M ^u, v) = <£m(u, v) o 

0- v ° V-Bt(M) (w, f) 

= <p M (t o t _1 (f),t;) ol MAt -i W o Jm.m o ,7m^„ o (^m^ -1 ^),* -1 ^)) o 

= (f M {t O r 1 ^),!!) O X M ,t,t-Hv) ° ^M,t,t-^(v) 

o <p M (i o ^(m),^ t -1 (u)) ol MAt -i {u) o Jm,^ 
= v9 A f(> ° iT 1 ^), o <^ M (t o r l {u),to r l (v)) o%-i(„) o 

= <£>Af(* ° ^M,t,t~ 1 (u) ° t7M,t,tt 

= ¥>m(«, U) ° <^m(* ° t^iu), U) O I Mit)t -l(„) o 

Remark 2.3.3. The material above can be adapted with only minor changes to 
the setting of _B n -persistence modules, where it sheds some light on the relationship 
between A n -persistence modules and £> n -persistence modules. Namely, the definitions 
of a well behaved persistence module, grid function, and the functors E t generalize to 
the multidimensional setting, and analogues of Propositions 12.3.11 and 12.3.21 hold in 
that setting. It can be shown that the functor (•) ®a„ B n mentioned in Section [2. 1.31 
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is naturally isomorphic to a generalized functor E t . The generalization of the above 
material also can be used to translate algebraic results about A n -persistence modules 
into analogous results about _B n -persistence modules. For example, it can be used to 
show that any finitely presented _B n -persistence module has a free resolution of length 
at most n-that is, an analogue of the Hilbert syzygy theorem holds for 5 n -persistence 
modules. 

However, as we have no immediate need for the generalization or its consequences 
in this thesis, we omit it. 



2.3.2 The Structure Theorem 



First, note that for any a G M>o, k[[a, oo)] (as defined in Section f2 . 1 . 2 p is an ideal of 
B 1 . 

For a < b G E, let C(a,b) denote (Bi/k[[b — a, <x>)])(— a); let C(a, oo) denote 
B±(— a). In analogy to the discrete case, for fixed b (possibly infinite), the set of 
modules {C(a, b)\a G M, a < b} has a natural directed system structure; let C(— oo, b) 
denote the colimit of this directed system. 

Lemma 2.3.4. Let M be a well-behaved persistence module and let V be a persistence 
diagram such that M = (B( a ,b)esu PP (v)C '(a, b) v ( a,b \ Then V M = V. 

Proof. Let 

A = {ae R\(a, b) G supp(P) for some b G R}U{b G R\(a, b) G supp(P) for some a G 1}. 

Let t be a grid function such that A C im(i). We claim that P(p t (M),t) — Since 
supp(r') G im(t x t), this is true if and only if 2}p t (M)(j/, z ) — T^^{y)i t( z )) f° r a ^ 

(y,z)e1 2 + . 

To show that Vp t (M){y, z) = ^^iy)^^)) f° r & H {Vi z ) G Zq_, we'll need the 
following analogue of Lemma 12.2.31 

Lemma 2.3.5. Let M,T>, and t be as above. 
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(i) For(y,z)eZ 2 + , 

V(t(y),t(z)) = mnk( m (t(y),t(z - 1))) - raok(^(t(y), t(z))) 

- mnk(<p M (t(y - l),t(z - 1))) + mnk(<p M (t(y - l),t(z))). 

(ii) For z£Z, 

V(—oo,t(z))= lim rsink((fM(t(y),t(z — l)))— lim Tank(ipM(t(y),t(z))). 

y—t—oo y—t—oo 

( in ) For i/eZ, 

V(t(y), oo) = lim Taak(<p M (t(y),t(z))) - lim xajik{<p M {t{y - l),t(z))). 

(iv) 

T>{— oo, oo) = lim lim ra.iok(ip M {t(y), t(z))). 

y—^—OD z—>oo 

Proof. The proof is straightforward; we omit it. □ 
For (y,z) G 1? + we have 

^p t (M)(y, z) = rank(v9p t(M )(y, z-1))- rank(^ Pt(M) (?/, z)) 

- rank(v9 Pt(M) (y -l,z-l)) + rank(<p Pt(M) (y - 1, z)) 
= Taak((p M (t(y),t(z - 1))) - Tank(<p M (t(y),t(z))) 

- miak(<pM(t(y - l),t(z - 1))) + T&Dk((p M (t(y - l),t(z))) 
= V(t(y),t(z)), 

where the first equality follows from Lemma r2.2.3( i). and the last equality follows 
from Lemma l2.3.5( i). 

Thus we have Pp t (Af)(j/, z) = V{i(y),i(z)) for all (y,z) G Z+. Similar arguments 
using Lemma l2.2.3( ii)-(iv) and Lemma l2.3.5( ii)-(iv) in the cases where y = — oo or 
z = oo show that in fact this holds for (y,z) G This proves the claim. 

It follows easily from the fact that M is well behaved that A is equal to the set of 
critical values of M. There thus exists a sequence of grid functions {tj}j e N such that 
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t{ is a 1/2* cover and A C im(tj) for each i. The lemma follows by writing T>m as the 
limit of the persistence diagrams P(p t .(Af),ti)- D 

Theorem 2.3.6. Let M be a well behaved Bi-persistence module. Let T>m be the 

persistence diagram of M. Then 

M = ® { a, b)& u PP (v M )C(a,b) v ^ a > b \ 

This decomposition of M is unique in the sense that if T> is another persistence dia- 
gram such that M = ®( a ,6)esupp(X))C '(a, b) T><ya ' b \ then V = T>m- 

Proof. By Lemma I2.3.4[ it's enough show that there exists some persistence diagram 
V such that M = ® {a ,b) & u PP (v)C(a,b) v ^ b \ 

By Proposition 12. 3. 2\ there exists a grid function t and a tame Aj-persistence 
module M such that E t (M.) = M. The structure theorem for tame Ai-persistence 
modules gives us that there's a persistence diagram £>m supported in 1? + such that 
we may take 

M = ® {a , b) esu PP (v M )C(a,b) v ^ b \ 

We'll show that 

M = ® {a , b)esupp(VM) E t (C(a,b)f^ a ' b \ 

We have that E t (C(a,b)) = C(i(a),i(b)) for any (a, b) e Z^_, so this gives the result. 

To show that M = ®^ b ) £suppiVw[ )Et(C(a,b)) v ^ a ' b \ we'll use the category theo- 
retic characterization of direct sums of modules as coproducts [33]. Let A be a set. 
Recall that in an arbitrary category, an object X is a coproduct of objects {X a } ae A 
iff there exist morphisms {i a : X a — > X} a€ A, called canonical injections, with the 
following universal property: for any object Y and morphisms {f a : X a Y} a& A, 
there exists a unique morphism / : X — y Y such that / o i a — f a for each a G A. 
In a category of modules over a ring R, The coproduct of modules X a is (B a X a ; the 
canonical injections are just the usual inclusions X a @ a X a . The same is thus true 
for the module subcategories A„-mod and -B n -mod. 

Now let {M a } denote the indecomposable summands of M in the direct sum 
decomposition M = ©( a ,b)esupp(D M )C( a > b) T>m ^ a ' b \ so that each M a = C(a, b) for some 
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(a, b) G 7j 2 + . Let {i a : M a — > M} denote the canonical injections. 

We'll show that the maps E t (i a ) : E t (Wl a ) — > E t (M) satisfy the universal property 
of a coproduct, so that M = E t (M) = ® a E t (M a ) as desired. 

To show that the maps E t (i a ) : E t (Wl a ) — > E t (M) satisfy the universal property 
of a coproduct, let Y be an arbitrary ^-persistence module and {f a : E t (M. a ) — > Y} 
be homomorphisms. 

For any z G Z, M z = © a M". It follows from the definition of E t that for any 
r G R, 

S t (M) r S © Q ^(M Q ) r 

with the maps E t (i a ) r the canonical inclusions. 

For each r G R, define f r : E t (M) r — >• F r as © a /° (i.e. / r is the map guaranteed 
to exist by the universal property of direct sums for vector spaces.) It's easy to check 
that the maps f r commute with the transition maps in E t (M.) and Y, so that they 
define a morphism / : E t (Wl) — > Y. We also have that / o E t (i a ) = f a for each a. 
By the universal property of direct sums of vector spaces, for each r f r is the unique 
linear transformation from E t (M.) r to Y r such that for each a, f r o E t (i a ) r = 
Therefore / must itself satisfy the desired uniqueness property. This completes the 
proof. □ 

2.4 The Equality of the Interleaving and Bottle- 
neck distances on Tame Bi-persistence Mod- 
ules 

We show in this section that the restriction of the interleaving distance to tame 
-Bi-persistence modules is equal to the bottleneck distance. This shows that the 
interleaving distance is in fact a generalization of the bottleneck distance, as we 
want. The result is also instrumental in proving Corollary I2.5.3[ our converse to the 
algebraic stability result of [8]. 
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The Algebraic Stability of Persistence 

The main result of [5], generalizing considerably the earlier result of [H], is the 
following: 

Theorem 2.4.1 (Algebraic Stability of Persistence). Let e > 0, and let M and N be 

two tame Bi-persistence modules. If M and N are e-interleaved, then dB{M,N) < e. 

A Converse to the Algebraic Stability of Persistence? 

In the conclusion of [8], the authors ask whether it's true that if M and iV are 
tame -Bi-persistence modules with ds(M,N) = e then M and iV are e-interleaved. 
Example 12.1.21 shows that this is not true. However, Corollary 12.5.31 below, which 
follows immediately from Theorems 12.4.21 and 12.5. 1[ asserts that the result is true 
provided M and iV are finitely presented. More generally, Corollary 12.5.31 tells us 
that if M and N are tame modules with ds(M, N) = e then M and iV are (e + 5)- 
interleaved for any 5 > 0. In other words, the converse of Theorem 12.4.11 holds for 
tame modules to arbitrarily small error. 

Theorem 2.4.2. ds{M, N) = di(M, N) for any tame Bi-persistence modules M and 
N. 

Proof. Theorem 12.4.11 tells us that ds{M,N) < di(M,N), so we just need to show 
that ds{M^N) > dj(M,N). It will follow from the structure theorem for well be- 
haved persistence modules (Theorem I2.3.6j) that o?b(M' , N') > di(M',N') for well 
behaved persistence modules M' and N' (Lemma 12.4.41 below) . To extend this result 
to arbitrary tame modules, we will approximate the modules M and iV up to arbi- 
trarily small error in the interleaving distance by well behaved persistence modules 
(Lemma 12.4.51 below) . The full result will follow readily from this this approximation. 

Lemma 2.4.3. If (a,b),(a',b') £ R 2 + with \\(a,b) - (a',6')lloo < e> then C(a,b) and 
C(a',b') are e-interleaved. 



Proof. This is easy to prove; we leave the details to the reader. 



□ 
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Lemma 2.4.4. If M and N are two well behaved persistence modules and ds{M, N) = 
e then di{M,N) = e. 

Proof. By stability we just need to show that dj(M, N) < e. By the structure theorem 
for well behaved persistence modules (Theorem I2.2.2p . we have that 

N = ® {a , b)esupp{ v N )C(a,b) v ^ b \ 

Since c?b(M', N r ) = e, for any 5 > there exist persistence diagrams D' M and D' N 
with D' M < Dm, D' n < Dn, and a multibijection 7 between D' M and D' N such that 

1. For any (a, b) G supp(Z)jvf — D' M ) U supp(Z^Ar — D' N ), {b — o)/2 < e + 5, 

2. For any (2, y) G supp(7), \\x — y||oo < e + 5. 

Fix such -D^f , D' N , and 7. Now we can choose well behaved modules M', M" C M 
and N', N" C iV such that M = M' © M", N = N' ® N", D M , = D' M , D N , = D' N , 
Dm" = D M - D' M , and D N „ = D N - D' N . 

If follows from Lemma 12.4.31 that for each (a,b), (a' ,b') G supp(7), C(a,b) and 
C(a', b') are (e + <5)-interleaved. We may write 

M' ©( a , ^) ),(a^ fe o e su P p( 7 )C(a,6)^^'( a '' 6, )), 

It's clear from the form of these decompositions for M' and N' that a choice of a 
pair of (e + <5)-interleaving homomorphisms between C(a,b) and C(a',b') for each 
pair (a,b), (a' ,b') G supp(7) induces a pair of (e + <5)-interleaving homomorphisms 
/ : M' ->■ iV'(e + 5) and g : N' M'(e + 5). 

Now we extend this pair to a pair of homomorphisms / : M — > N(e + 8), g : N — >■ 
M(e + 5) by defining /(y) = f(y) for y G M', f{M") = 0, ^(y) = g(y) for y G N', 
and g(M") = 0. Obviously, / and g restrict to (e + <5)-interleaving homomorphisms 
between M' and N'. Moreover, we have that S(M", 2e + 5) = and S(N", 2e + 5) = 0, 
so / and g restrict to (e + <5)-interleaving homomorphisms between M" and N" . Thus 



CHAPTER 2. INTERLEAVINGS ON MULTI-D PERSISTENCE MODULES 49 



by linearity, / and g are (e + ^-interleaving homomorphisms between M and N. Since 
S may be taken to be arbitrarily small, we must have di(M, N) < e, as we wanted to 



Lemma 2.4.5. For any tame Bi-persistence module M and 5 > 0, there exists a well 
behaved persistence module M' with dj(M,M') < 5. 

Proof. Let t be an 5/ 2-cover of R, as defined in Section 12.2.31 For any r£R, there 
exists r' G im(t), with < r' — r < 5. Define a function A : R — > im(t) such that 
A(r) = min{r' > r\r' G im(t)}. Then < A(r) — r < 5 for all r G R. 

Let M = P t (M) (as defined in Section and let M' = E t (M). Then M' is 

well-behaved. We now show that M and M' are 5-interleaved, which implies that 
dj{M, M') < 5. 

Define / : M — >■ M'(S) to be the morphism for which 

f u :M u ^ M' u+S = <p M ,(\(u),u + 5)o J M \ x(u) o l M \ t - 1(x{u)) o ^ m (m, A(u)). 

Define g : M' — >• M(5) to be the morphism for which 

g u :M' u ^r M u+S = tp M (\(u),u + 5) o X MAt -i (A(u)) o JTm^ah ° ^'(w, A(m)). 

We need to check that / and g thus defined are in fact morphisms. We verify this 
for /; the verification for g is similar; we omit it. 
If u < v G R, we have 

f v o(p M (u,v) = (p M '(X(v),v + S) oJ^ tx{v) oI^ t _ 1Wt)) o<f M (v,X(v)) o(p M (u,v). 



show. 



□ 



(2.1) 



By definition, for any u < v G R, we have 



<PM'(u,v) = J M \ 



o ip M (t 1 (u),t 1 (v)) o J Mt> 



M,t,v 



oT 



(„) ° Vm(u, v) o l M ,t,t-Hu) ° Jm,!,«. 



(2.2) 
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Using (12. 2p to substitute for ip M i(X(v ), v + 5) in (12. ip and simplifying gives us: 
fv o (p M (u, v) = J M \ V+S o 2^ t)t _ 1(w+i) o (p M (X(v), v + 5)o (p M (v, X(v)) o (p M (u, v) 

= ^M,t,v+S ° X M,t,t-i(v+5) ° VM(U, V + 5). 

On the other hand we have, using (12.21) again, 
ip M '(u + S,v + 5) o f u 

= ifu'iu + 5,v + 5)o <f M ,(\(u),u + 5) o J M ] tMu) ol M ] t ^ 1(x{u)) o (p M ( U} X(u)) 

= (p M ,(\(u),V + 5)o J M \ X{U) o X M ^_ 1(A(u)) o (p M ( U , X(u)) 

= Jm\v+s ° X M,t,t-^v+S) ° <Pm(Mu), v + 5)o I M , t ,t-i(\(u)) 

o Jm,t,X{u) ° Jm,t,\{u) ° X M,t,t-^\(u)) ° <Pm(u, X{u)) 
= Jm,t,v+s ° x llt,t^{v+s) ° <Pm(\(u), v + 5)o m (u, X(u)) 
= Jm,t,v+5 ° x llt,t-^v+s) ° <Pm(% v + 5). 

Thus /„ o <pm{u, v) = <pm'(u + 5,v + 5) o f Uj as we wanted to show. 

Finally, we need to check that gof — S(M, 25) and fog = S(M', 25). We perform 
the first verification and omit the second, since the verifications are similar. 

For u e R, 

9u+5 ° fu 

= Wm{K u + 5),u + 25) o l M ,t,t-i(\(u+s)) ° Jm,t,\{u+s) 
o <p M ,( u + 5, X(u + 5)) o <p M ,(X(u),u + 5) o J^txu) oX i,t-i(A(«)) ° ¥to( u > 

= Lf M (X(u + 5),U + 25) O l M ,t,t-i(\(u+5)) ° i7M,t,A(«+5) 

O ^(A(U), A(u + 5)) O JM,tXu)° X M,t,t-HX(u)) ° ^(«, A(u)). 
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Using (I2.2p once again, we have that this last expression is equal to 



(f M (A(W + 5), U + 25) O Z M ,t,t-i(\(u+6)) ° JM,t,X{u+5) ° <?M,t,\(u+d) 



ol 



-1 



'M,t,t- 1 (\(u+S)) 



o (p M (X(u), X(u + 5)) oX A , w -i (A(u)) o J Mjti x( u ) ° Jm,t,\{u) oT i 
= (p M {X{u + S),u + 25) o(p M (X(u),X(u + S)) ocp M (u,X(u)) 
= pm(u, u + 28). 



O Lf M (u, X(u)) 



□ 



Now we can complete the proof of Theorem 12.4.21 As mentioned above, by The- 
orem EXH it suffices to show di(M, N) < d B (M,N). Say d B (M,N) = e. Choose 
5 > 0. By Lemma l2~4~5l there exist well behaved modules M', N' with <i/(M, M') < 5, 
dj(N,N') < 5. Then by Theorem EXIl d B (M,M') < 5, d B (N,N') < 5, so by the 
triangle inequality, d B (M',N') < e + 25. By Lemma di{M',N') < e + 25. 

Applying the triangle inequality again, we get that di(M, N) < e + 45. As 5 may be 
taken to be arbitrarily small, we have di(M, N) < e, which completes the proof. □ 



2.5 If d 7 (M, N)=e and M, N are Finitely Presented 
then M and are e-interleaved 



We now show that for finitely presented 5 n -modules M and N, if dj(M, N) = e then 
M and iV are e-interleaved. This implies that the restriction of dj to finitely presented 
persistence modules is a metric and, as noted in Section \2A\ yields a converse to 
the algebraic stability of persistence for finitely presented .Si-persistence modules. 
Theorem 12.5.11 will also be of some use to us in Section I2.11[ 

Theorem 2.5.1. If M and N are finitely presented B n -modules and di(M,N) = e 
then M and N are e-interleaved. 

Corollary 2.5.2. di is a metric on finitely presented B n -modules. 

Corollary 2.5.3 (Converse to Algebraic Stability). 

(i) If M and N are finitely presented B\-persistence modules and d B (M,N) = e 
then M and N are e-interleaved. 
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(ii) If M and N are tame Bi-persistence modules and ds{M,N) = e then M and 
N are (e + 5) -interleaved for any 5 > 0. 

Proof, (ii) follows directly from Theorem 12.4.21 (i) is immediate from that theorem 
and Theorem 12.5.11 □ 

For a finitely presented S n -persistence module M, let Um C M™ be the set of 
grades of the generators and relations in a minimal presentation for M. Let U M C R 
be the set of i th coordinates of the elements of Um- 

Proof of Theorem \2.5.1[ 

Lemma 2.5.4. If M is a finitely presented B n -persistence module then for any a < 
b G M. n such that (a*, &i] H U M = /or a// i, <Pm(ci, b) is an isomorphism. 

Proof. This is straightforward; we omit the details. □ 

Lemma 2.5.5. If M is a finitely presented B n -persistence module then for any y G 
MJ 1 , there exists r G M. >0 such that ipM{y,y + r is an isomorphism for all < r' < r. 

Proof. This is an immediate consequence of Lemma 12.5.41 □ 

For a finitely presented -B n -persistence module M, let }Im '■ ^ n — > ^i=iU M be 
defined by // M (ai, a„) = (a' 1; a' n ), where a[ is the largest element of U M such 
that a'i < aj, if such an element exists, and = — oo otherwise. 

Lemma 2.5.6. For any finitely presented B n -module M and any y G K n wit/i 
fltdiy) G R n ; we /iaue t/iat ipM(fhi(y),y) is an isomorphism. 

Proof. This too is an immediate consequence of Lemma 12.5.41 □ 

Having stated these preliminary results, we proceed with the proof of Theo- 
rem [53IU By Lemma 12.5.51 and the finiteness of Um and Un, there exists 5 > 
such that for all z G Um, <Pn(z + e, z + e + 5) and ipm{z + 2e, z + 2e + 25) are iso- 
morphisms, and for all z G Un, (Pm( z + e, z + e + 5) and v^at^ + 2e, z + 2e + 25) are 
isomorphisms. 

By Remark I2.1.1[ since di(M, N) = e, M and iV are (e + <5)-interleaved. 
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Theorem 12.5.11 then follows from the following lemma, which will also be the key 
ingredient in the proof of Proposition 12.11.91 

Lemma 2.5.7. Let M and N be finitely presented B n -persistence modules and let 
e > and 5 > be such that 

1. M and N are e + 5 '-interleaved, 

2. for all z G Um, ^N^z + e, z + e + 5) and <fM(z + 2e, z + 2e+28) are isomorphisms, 

3. for all z G Un, <Pm(z + c, z + e + 5) and ipN(z + 2e, z + 2e + 25) are isomorphisms. 

Then M and N are e-interleaved. 

Proof. Let / : M — > N(e + 5) and g : iV — >■ M(e + 5) be interleaving homomorphisms. 

We define e-interleaving homomorphisms / : M — > N(e) and g : N — >• M(e) via 
their action on homogeneous summands. First, for z G Um define f z = <p~^~(z + 
e, z + e + 5) o f z . Then for arbitrary z G M. n such that JIm{z) G W 1 define f z = 
VnUIm{z) + e,z + e) o f f i M ( z) o ^{fl M {z),z). (Note that Lp^(fl M (z), z) is well 
defined by Lemma EEB) Finally, for z G E n s.t. fl M (z) £ M n , define f z = 0. (If 
flhi{z) G" W 1 then M z = 0, so this last part of the definition is reasonable.) 

Symmetrically, for z G Un define g z = (p]^(z+e, z+e+5)og z . For arbitrary zGl™ 
such that fl N (z) el" define & = (p M (fl N (z) + e, z + e) o g f i N{z) o (p~ l (fl N (z), z). For 
z G R n s.t. fl N (z) £ R n , define g z = 0. 

We need to check that /, g as thus defined are in fact morphisms. We perform 
the check for /; the check for g is the same. 

If y G lR n is such that fhi{y) & R n , then since M y = 0, it's clear that f z o 
Vm{v, z) = ip N (y + e, z + e) o f y . 
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For y < z G R n such that fl M (y) G R n , 

L o (p M (y, z) = (p N (fl M (z) + e, z + e) o f f i M ( z ) ° VmU 1 m{.z),z) o tp M (y, z) 

= VnU 1 m(z) + e,z + e)o ip N 1 (fl M (z) + e, fl M (z) + e + 5) o //* M ( Z ) 

°VmU 1 m{z),z) oip M (y,z) 
= VnU 1 m(z) + e,z + e)o ip N 1 (fl M (z) + e, fl M (z) +e + 5)o f flm{z) 

° <PM(flM(y), flM(z)) o (fM(fl M (y),y) 

= VnUIm{z) + e,z + e)o ip N 1 (fl M (z) + e, /Z M (^) + e + 5) 

o VN{flM{y) + e + 5, fl M (z) + e + 5) o f fhl{y) o ipM(fl M (y),y) 
= (p N (y + e,z + e)o (p N (fl M (y) + e,y + e) 

° ¥ N l U l M{y) + e, fl M (y) + e + 5) o / /jM(y) o <PM(fl M (y),y) 
= ip N (y + e,z + e)of y 

as desired. 

To finish the proof, we need to check that g o / = S(M, 2e) and fog — S(N, 2e). 
We perform the first check; the second check is the same. 

For ^ G M n , if /7 M (V) £ K n then since M 2 = 0, g z+e o f z = = y9 M (^ z + 2e). 

To show that the result also holds for z such that JIm{z) G IR n , we'll begin by 
verifying the result for z G Um- We'll use this special case in proving the result for 
arbitrary z G lR n such that fl M (z) G R n . 

If z G C/m then, by assumption, <pm(z + 2e, z + 2e + 25) is an isomorphism. Thus, 
to show that g z+t o f z — <^m(z, z + 2e), it suffices to show that <Pm(z + 2e, z + 2e + 
25) o (? z+e o f z = ip M (z, z + 2e + 25). 
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For z e U M , we have 

° fz = ¥M(flN{z + e) + e, z + 2e) o g f i N ( z+e ) o <pjf(fl N (z + e),z + e) o f z 
= VmUIn{z + e) + e, z + 2e) o y~^{fl N {z + e) + e, /Zat(z + e) + e + 5) 

° 9fi N {z+e) o ^(flNiz + e),z + e)o f z 
= <PM(flN(z + e) + e, z + 2e) o y~^(fl N (z + e) + e, /^(z + e) + e + 5) 
° 9fi N (z+e) o (pjfiflNiz + e),z + e) o <prf(z + e,z + e + 5) o f z . 

Thus 

V?m (z + 2e, z + 2e + 25) o o f z 
= (f M (z + 2e, z + 2e + 5) o (p M (flN(z + e) + e, z + 2e) 

° <Pm(/In(z + e) + e, /Zjv(z + e) + e + 5) o g f i N ( z+e ) 

° ^UIn{z + e), z + e) o ^(z + e, z + e + 6) o f z 
= ip M {z + 2e + 5, 2 + 2e + 25) o <p M (fl N (z + e) + e + 5, z + 2e + 5) 

° 9fi N (z+e) ° ^{fl N {z + e), z + e) o ip-\z + e, z + e + 5) o f z 
= ip M (z + 2e + 5,z + 2e + 25)o g z+t o ip N (fl N (z + e), z + e) 

° ^(flNiz + e),z + e)o<pJ r 1 (z + e,z + e + 6)o f z 
= 9z+e+6 ° y?Ar(z + e, z + e + 5) o + e, 2 + e + <*) o / a 

= </?(z, z + 2e + 25) 
as desired. 

Finally, for arbitrary z G R n such that /Im(z) ^ 1", we have, using that g is a 
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morphism, 

9z+e ° fz = WmU 1 n(z + e) + e, z + 2e) o g f i N{z+e) 

° V~nU 1 n(z + e), z + e) o ip N (fl M (z) +e,z + e)o f fhl{z) o ip^{fl M {z), z) 
= g z+e o cp N (fl N (z + e),z + e)o ip N 1 (fl N (z + e), z + e) 

° VnU 1 m{z) + e,z + e)o ff h/[(z) o ^(fl M (z), z) 
= §z+e o VnU 1 m{z) + e,z + e)o f fhl{z) o ip]^(fl M (z), z) 
= <PM(flM(z) + 2e,z + 2e) o g f i M ( z)+t o ffi M ( z ) o ip^(fl M (z), z) 
= VmUIm{z) + 2e,z + 2e) o f M (fhi(z), fhi(z) + 2e) o cp^(fl M (z), z) 
= (p M (z,z + 2e) 

as we wanted. □ 

This completes the proof of Theorem 12.5.11 □ 

Remark 2.5.8. As noted in Remark 12.3.31 the notion of a well behaved persis- 
tence module admits a generalization to the mult i- dimensional setting. An interest- 
ing question is whether Theorem 12.5.11 generalizes to well behaved multidimensional 
persistence modules; if it does, then we obtain corresponding generalizations of Corol- 
laries 12.5.21 and 12.5.31 Our proof of Theorem 12.5.11 does not generalize directly. 

2.6 An Extrinsic Characterization of Interleaved 
Pairs of Multidimensonal Persistence Modules 

In this section, we introduce (Ji, J 2 )-i n terleavings of pairs of _B„-persistence modules, 
a generalization of e-interleavings of pairs of _B n -persistence modules. These general- 
ized interleavings serve as a convenient language for expressing nuanced relationships 
between ^-persistence modules which arise in our study of topological inference 
using Rips multifiltrations in Section 14.4.21 

After defining generalized interleavings, we present Theorem 12.6.41 our "extrinsic" 
characterization of ( J\, ^-interleaved pairs of persistence modules; as noted in the 
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introduction, this result expresses transparently the sense in which ( J 1? J 2 )-interleaved 
persistence modules are algebraically similar. In particular, Theorem 12.6.41 gives an 
algebraically transparent characterization of e-interleaved pairs of modules, which 
we write down as Corollary 12.6.51 This characterization induces in an obvious way 
a corresponding characterization of the interleaving distance. It is also the most 
important step in our proof of our main optimality result Corollary 12.10.21 

2.6.1 ( J\, J2)-Interleavings 

In Section 12.1.41 we introduced shift functors and transition morphisms on B n - 
persistence modules. To prepare for the definition of (J±, J2)-interleavings, we define 
generalized shift functors and generalized transition morphisms. 

Generalized Shift Functors 

We say that a bijection J : IR n — >■ M. n is order-preserving if V a, b G IR n , a < b iff 
J(a) < J(b). For any order-preserving map J : R™ — > M n , we define the generalized 
shift functor {-){J) '■ B n -mod— > B n -mod, as follows: 

1. Action of {-){J) on objects: For M G obj(-B n -mod), we define M(J) by taking 
M(J) a = Mj( a ) for all a G M™. For all a < b G 1R™ we take the transition map 
V 9 a/(j)(o, b) to be the map <p M (J(a), J(b)). 

2. Action of {-){J) on morphisms: For M, N G obj(-B n -mod) and / G hom(M, N), 
we define f{J) '■ M(J) — > N(J) to be the homomorphism for which f{J) a — 
f J[a) for all a G K™. 

For u G M n , let J u : R™ — > W 1 be the map defined by J u (a) = a + u, and for eel, 
let J t denote J? n . is equal to the shift functor (•)(«) introduced in Section [2. 1.41 

and (-)(</e) is equal to the shift functor (-)(e) introduced in the same section. 

As in the case of ordinary shift functors, for a morphism / and J an order- 
preserving map, we will sometimes abuse notation and write f(J) simply as /. 
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Note the following contravariance property of generalized shift functors: If J\ and 
J 2 are order-preserving, then 



Generalized Transition Homomorphisms of I? ra -persistence Modules 

Say an order-preserving map J : M. n — > M. n is increasing if J(a) > a for all a G R n . 
For a -B n -persistence module M and J an increasing map, let S(M, J) : M — > M(J), 
the J-transition homomorphism, be the homomorphism whose restriction to M a is 
the linear map <p M (a, J (a)) for all a G W 1 . Note that for e G M> , S(M, e) = 5(M, J e ), 
where S(M,e) is as defined in Section [2. 1.41 

The following lemma gives three easy and useful identities for generalized transi- 
tion morphisms. Of particular note is Lemma [2.6. l( i). 

Lemma 2.6.1. 

(%) For any f : M — >■ N G hom(5 ra -mod y ) and any J : M n — > M n increasing, 



(•)(^oJ 1 ) = (-)(Ji)o(-)(J 2 ). 



5(iV, J)of = f(J)oS(M, J). 



(ii) For any J, J' increasing and B n -persistence module M , 



S(M, J'){J) o 5(M, J) = 5(M, J' o 



J). 



(zmJ For any J, J' increasing and Bn-persistence module M, 



S(M, J')(J) = S(M(J), J' 1 ofo 



J). 



Proof. We'll prove (iii) and leave the proofs of (i) and (ii) to the reader. For any 
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S(M, J') J(o ) 

(Pm(J(o), J ° J 1 o J' o J (a)) 

s(M(j), r l o j'o j) a . 



□ 



(Ji, </ 2 )-Interleavings of ^-Persistence Modules 

For Ji, J2 : M. n — > M n increasing, we say an ordered pair of ^-persistence modules 
(M,N) is (Ji, ^-interleaved if there exist homomorphisms / : M — > N(J\) and 
g : N M(J 2 ) such that 



we say that (/, g) is a pair of (Ji, J 2 )-interleaving homomorphisms for (M,N). 

Note the asymmetry of ( Ji, J 2 )-interleavings. If (M,N) is (Ji, J2)-interleaved, it 
needn't be true that (M,N) is (J 2 , Ji)-interleaved. It is however true that (N,M) is 
(J2, Ji)-interleaved. 

Remark 2.6.2. A note on terminology: When a pair (M, AT) of -B n -persistence mod- 
ules is ( Ji, J 2 )-interleaved, we will often say simply that M, A 7 " are ( J±, J 2 )-interleaved, 
or that M and A" are (Ji, J 2 )-interleaved. Similarly, when (/, g) is a pair of (Ji, J 2 )- 
interleaving homomorphisms for (M,N), we will often say simply that /, c/ are 
( Ji, J 2 )-interleaving homomorphisms for M, N. 

Note that M and A" are e-interleaved if and only if the pair (M, N) is ( J e , J e )- 
interleaved. 

2.6.2 A Characterization of (Ji, ^-interleaved Pairs of Mod- 
ules 

We now present our characterization of pairs of ( J\, J 2 ) -interleaved _B n -persistence 
modules. 



g^) of = S(M, J 2 o J x ) and 
f(J 2 )og = S(N,J 1 oJ 2 ) ] 
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Notation 

To state the theorem we need some notation. 

If G — (G, ia) an n-graded set, and J : W l — > W 1 is an order-preserving map, let 
G(J) denote the n-graded set (G,l' g ), where t' G (y) = J _1 (i(w)). Note that if J 1 and 
J 2 are order-preserving, then G(J 2 o J x ) = (G( J 2 ))( Ji). 

Similarly, if M is a £? n -persistence module and Q C M is a homogeneous subset, 
let Q(J) C M(J) denote the image of Q under the bijection between M and M(J) 
induced by the identification of each summand M(J) a with Mj( ). Note that if J\ 
and J 2 are order-preserving, then Q( J 2 o J t ) = J 2 ))( Ji). 

Remark 2.6.3. For any n-graded set G and J : IR n — >■ IR n an increasing map, 
the homomorphism S({G( J^ 1 )}, J) ■ (G(J -1 )} — > (G) is injective, and so gives an 
identification of (G(J -1 )) with a submodule of (G). More generally, if G 1 and G 2 are 
n-graded sets, we obtain in the obvious way an identification of (Gi, G 2 ( J -1 )) with 
a submodule of {G\,G 2 )- In particular, for any e > 0, we obtain an identification 
of (G(—e)) with a submodule of (G) and, more generally, of (Gi,G 2 (— e)) with a 
submodule of (Gi,G 2 ). 

Theorem 2.6.4. Lei M and N be B n -persistence modules. For any Ji, J 2 : IR n — > 

W" 1 increasing, (M,N) is (J ± , J 2 ) -interleaved if and only if there exist n-graded sets 
Wi,W 2 and homogeneous sets y^y 2 C (Wi,W 2 ) such that y 1 G (Wi, W 2 ( J^T 1 )}, 
^ 2 e (Wi^r 1 ),^), and 

M=(W 1 ,W 2 (J 2 - 1 )|^i,y 2 (^ 1 )), 

7/M and N are finitely presented, then Wi, W 2 , 3^i, 3^2 can &e to£;en to be finite. 

The following corollary is immediate. 

Corollary 2.6.5. Let M and N be B n -persistence modules. For any e G K>o, M and 
N are e-interleaved if and only if there exist n-graded sets Wi, W 2 and homogeneous 
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sets y x ,y 2 c (Wi, W 2 > such that y x e (Wi,W 2 {-e)), y 2 e (Wi(-e), W 2 ) ; 

M-(W 1 ,W 2 (-e)|y 1 ,y 2 (-e)) 
iV= (Wi(-e),W 2 |yi(-e),y 2 ). 

7/M and iV are finitely presented, then Wi, W 2 , 3^2 can 6e taken to be finite. 



Proof of Theorem \2.6.4\ It's easy to see that if there exist n-graded sets Wi, W 2 and 



sets 3^1,3^2 C (Wi,W 2 ) as in the statement of the theorem then M,N are (Ji, J 2 )- 
interleaved. 

To prove the converse, we will first express M and N as an isomorphic copies of 
two quotient persistent modules which are, in a suitable sense, algebraically similar. 
We will then construct the desired presentations by lifting the structure of these 
quotients to free covers. 

The construction by which we express M and N as quotients is a generalization to 
( Ji, J 2 )-interleaved pairs of -B„-persistence modules of one introduced for e-interleaved 
pairs of I?i-persistence modules in the proof of [H Lemma 4.6]. 

Let /, g be (J\, J 2 )-interleaving homomorphisms for M, N. 

Lemma 2.6.6. Let^ : M(J^oJ 2 l ) -> M®N(J 2 l ) be given by ^(y) = (S(M(Jr l o 
J 2 X ),h ° Ji){y),-f{y)). Let 72 : N^ 1 ) -> M © N(J 2 l ) be given by l2 {y) = 
(—g(y),y). Let R C M © N(J 2 l ) be the submodule generated by im(7i) U im(7 2 ). 
Then 

M = (M © N(J 2 1 ))/R. 

Proof. Let i : M -»■ M © iV^" 1 ) denote the inclusion, and let ( : M © iVf^ 1 ) -+ 
M © N (J^ 1 ) I R denote the quotient. We'll show that £o6 is an isomorphism. For any 

(vm,Vn) e M © iVlJa" 1 ), (-g(y N ),yN) e ^(72) c i2, so C ° = (0,^) + A 

Therefore £ ^(un) + 2/m) = (z/a/, 2/jv) + -R- Hence £ t is surjective. 

£ o 1 is injective iff i(M) PI R = 0. It's clear that t(M) D im(7 2 ) = 0. Thus to show 
that C o l is injective it's enough to show that im(7i) C im(7 2 ). If w G M(J x l o J 2 X ), 
then since S{M{J7 l oJ^), J 2 o J x )(y) = </o/(y), (5(M(Jf 1 o^" 1 ), J 2 o J^y), -/(y)) = 
/(y), -/(y)) = 72(-/(y))- Thus im(7i) C im(7 2 ) and so C ° t is injective. 
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Thus £ o l is an isomorphism. □ 

Now let (Gm\Rm) be a presentation for M and let (Gn\Rn) be a presentation for 
N. Without loss of generality we may assume M = (Gm) / (Rm) and N = (Gn) / (Rn)- 
Let pm '■ (Gm) M, Pn : (Gjv) — > N denote the quotient maps. Then ((Gm), Pm) 
and ((G^) i Pn) are free covers for M and N. 

Let / : (G M ) -> (G N (Ji)) be a lift of / and let g : (G^) -> (G M (J 2 )) be a lift of 

9- 

Let P M ,iV = {y - /(y)}ygG M (Jr 1 ) and let Rn ' m = { y ~ 9(y)} y eG N (J^)- Note that 
Rm,n is a homogeneous subset of (Gm(Ji X ), Gjy) and Pjv,Af is a homogeneous subset 
of {G M ,G N (J^)). 

Let 

Pm = (Gm,Gn(J 2 1 )\Rm,Rn(J 2 ),Rm,n{J 2 1 ),Rn,m), 
Pn = (Gm(J\ ), G\w|-Rm( )iRniRm,NiRn,m(J\ 

RmM^ 1 ) lies in (GmC^oJs -1 ), G w ( J^ 1 )). By RemarkEOSIJ the 

map S(Gm(J\ ° 

J^ 1 ), J2 o Jx) identifies (Gm^i 1 ^T 1 )) with a subset of (Gm)- Thus Pa/ is well de- 
fined. By an analogous observation, Pn is also well defined. 

We claim that Pm is a presentation for M and P/v and is a presentation for N. 
We'll prove that P^ is a presentation for M; The proof that Pjy is a presentation for 
iV is identical. 

Let 

P = (Cm, Gn(J 2 )), 

•K" = (Pm, Rn(J 2 1 ), Rm,n(J 2 ), Rn,m) 

K'=(R M ,R N (J 2 1 ))- 

Let p : P — )■ F/K' denote the quotient map. Clearly, we may identify F/K' with 
M© ^(J^ 1 ) and p with (pm,Pn)- We'll check that p maps (Pm,v(^ 1 )) surjectively 
to im(7i) and (Rn,m) surjectively to im(7 2 ), so that under the identification of F/K' 
with M®N(J^ 1 ), K/K' = R. Given this, it follows that P M is a presentation for M 
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by Lemma 12.6.61 and the third isomorphism theorem for modules [18] . 

We first check that (p(Rm,n(J2 ))) = i m (7i)- Viewing Rm^n^J-z 1 ) as a subset of 
(G M , G n {J^)), RmAJ^ 1 ) = {S((G M (J^ ° J* 1 )), -h ° Jt)(y) - f(y)} yeGMi j r ^y 
SdGMiJi 1 o J^ 1 )), J 2 o J x ) is a lift of S(M(Ji 1 o J^ 1 ), J 2 o J : ) and / is a lift of /, 
so for any y e Gm^i 1 ° ^2 _1 )) 

p{S{{G M {J x l o J- 1 )), J 2 o J x )(y) - f(y)) 

= (5(M(Jf 1 o J" 1 ), J 2 o J x )(p M {yj), -f(p M (y))) 

= ii{pAi{y))- 

Thus p(Rm,n(J$ T 1 )) c i m (7i)- Since Gm generates (Gm) and pm is surjective, we 
have that pURm^^ 1 ))) = i m (7i)- 

The check that (p(Rn,m)) = i m (72) is similar to the above verification that 
(p^a/.jv^ 1 ))) = im (7i), but simpler. R NjM = {y - g(y)} veGlf ^y 9 is a lift 
of g so for any y G (jjv"(J 2 ~~ ), 

p(y - g{y)) = (-g(PN(y)),PN(y)) = 72(pjv(y))- 

Thus p(Rn,m) C. im(7 2 ). Since G^ generates (Gjy) and p^ is surjective, we have that 
p((Rn,m)) = im(7 2 ). 

This completes the verification that Pm is a presentation for M. 

Now, taking Wi = G M , W 2 = G N , y x = R M U Rn,m-> an d ^2 = -Rat U -Rm,at 
gives the first statement of Theorem 12.6.51 If M and N are finitely presented then 
Gm, Gn, Rmi Rn, Rm,n, and Rn,m can all be taken to be finite; the second statement 
of Theorem 12.6.51 follows. □ 

2.7 Geometric Preliminaries 



In this third section of preliminaries, we present preliminaries of a geometric and 
topological nature which we will need in the remainder of the thesis. 
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One organizational note before proceeding: We define multidimensional nitra- 
tions and multidimensional persistent homology here, in Section 12.7.31 In our study 
of topological inference in Chapter 4, we will need a slightly more general defini- 
tion of multidimensional nitrations and a correspondingly more general definition of 
multidimensional persistent homology. However, since these more general definitions 
are not needed in the remainder of this chapter, we will defer their introduction to 
Section 13.11 

2.7.1 CW-complexes and Cellular homology 

Our proof of the optimality of the interleaving distance in Section 12.101 will involve 
the construction of CW-complexes and the computation of their cellular homology. 
We now briefly review finite dimensional CW-complexes and cellular homology. 

Definition of a Finite-dimensional CW-complex 

A CW-complex is a topological space X together with some additional data of at- 
taching maps specifying how X is assembled as the union of open disks of various 
dimensions. We quote the procedural definition of a finite-dimensional CW-complex 
given in [29J. 

Let D 1 denote the unit disk in W; for a contained in some indexing set (which 
will often be implicit in our notation) let D l a be a copy of D l . When i is clear from 
context, we will sometimes denote D l a simply as D a . 

A finite-dimensional CW-complex is a space X constructed in the following way: 

1. Start with a discrete set X°, the 0-cells of X. 

2. Inductively, form the i-skeleton of X 1 from X % ~ x by attaching z-cells e % a via maps 
a a : S^ 1 — > X 1 ^ 1 . This means that X 1 is the quotient space of X l ~ 1 JI a D l a under 
the identifications x ~ a a (x) for x G SD l a . The cell e l a is the homeomorphic 
image of D l a — 5D l a under the quotient map. 

3. X = X r for some r. We call the smallest such r the dimension of X. 

The characteristic map of the cell e l Q is the map $ Q : D l a — > X which is the composition 
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D l a ■=->■ X 1 1 JI a D^, — > X % X, where the middle map is the quotient map defining 

X\ 

A subcomplex of a CW-complex X is a closed subspace A of X which is a union of 
the cells of X; those cells contained in A are taken to have the same attaching maps 
as they do in X. 

Cellular Homology 

We mention only what we need about cellular homology to prove our optimality result 
Theorem 12.10.11 For a more complete discussion and proofs of the results stated here, 
see e.g. [29] or [2]. 

For % e Z>o, we'll let Hi denote the i th singular homology functor with coefficients 
in the field k. 

For X a CW-complex and i e N, let df : H^X^X*- 1 ) -> F^ipC" 1 , X l ~ 2 ) 
denote the map induced by the boundary map in the long exact sequence of the pair 
(X t ,X l ~ 1 ). It can be checked that the df give 

■ ■ • %' HiiX^X'- 1 ) % ifi-ipf'- 1 , A*" 2 ) %' H (X°) -)• 

the structure of a chain complex, and that the i th homology vector space of this chain 
complex, denoted H? W (X), is isomorphic to H^X). 

It can be shown that a choice of generator for Hi(D l , S t ~ 1 ) = Z induces a choice 
of basis for Hi(X l , X^ 1 ) whose elements correspond bijectively to the z-cells of X. 
We now fix a choice of ^HAW)***,^ We can then think of 
Hi(X % , X 1 " 1 ) as the /c-vector space generated by the z-cells of X. 

It follows from the equality H$ W {X) = H Q (X) that in the case that X has a 
single 0-cell, df = 0. 

For i > 1, the cellular boundary formula gives an explicit expression for df . To 
prepare for the formula, we note first that for i 6 N, the choice of generator for 
Hi(D\ S 1 ' 1 ) induces a choice of generator aj for H^D 1 / 'S 1 ^ 1 ) via the quotient map 
D % — > D t /S l ~ 1 . Also, the choice of generator for Hi + i(D l+1 , S l ) induces a choice of 

1 Such a choice is induced e.g. by the standard orientation on D l . 
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generator bi for Hi(S l ) via the boundary map in the long exact sequence of the pair 
(D l+l , S l ). For each i EN, choose p % : D l / S 1 ^ 1 — > S l to be any homeomorphism such 
that pi : HiiDyS*- 1 ) -> H^S*) sends a 4 to h. 

For i e N and an z-cell of X, let (e^) c denote the compliment of in X*, and 
let qp : X* — > X*/(e^) c denote the quotient map. p l and $,3 induce an identification 
of g^X 4 ) with S\ 

By a compactness argument [2H1 Section A.l], for any i-cell e l a the image of the 
attaching map a a of e l a meets only finitely many cells. 
For i > 1, the cellular boundary formula states that 

E deg( 9/J o a^ej" 1 . 

im(<T a )Ue^"V0 

Here, for any map / : 5" _1 — >■ S 11-1 , deg(/) denotes the field element a E k such that 
/* : iJ i _i(S'^ 1 ) ->■ iJi_i(5' i " 1 ) is multiplication by a. 

We can endow the set of CW-complexes with the structure of a category by taking 
hom(X, Y) for CW-complexes X, Y to be the set of continuous maps / : X — > Y such 
that f{X l ) C Y % for all i. We call maps / E hom(X, Y) cellular maps. It can be 
shown that a cellular map / induces a map Hf w (f) : Hf w (X) —> Hf w (Y) in such 
a way that Hf becomes a functor. 

Further, there exists a natural isomorphism [33] ft : -fff 1 ^ — > Hi, where Hi is the 
restriction of Hi to the category of CW-complexes. 

2.7.2 Multidimensional Filtrations 

Fix n E N. 

Define an n-filtration X to be a collection of topological spaces {X a } ag Rn, to- 
gether with a collection of continuous maps {cf>x{d, b) : X a — > X b } a < bgK n such that if 
a < b < c E M n then <j)x{b,c) o <f) X {a,b) = (j)x(a,c). Given two n-filtrations X and 
Y, we define a morphism / from X to Y to be a collection of continuous functions 
{fa} a m n ■ X a ^-Y a such that for all a < b E W 1 , f b o (f) X (a, b) = <p Y (a, b) o /„. This 
definition of morphism gives the n-filtrations the structure of a category. Let n-fi.lt 
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denote this category. 

Define a cellular n-filtration to be a collection of CW-complexes {X a } a< z^n, 
together with inclusions of subcomplexes b) : X a — > X b } a < beIR n. Given two 

cellular n-filtrations X and Y, we define a morphism / from X to Y to be a collection 
of cellular maps {f a }aeR n '■ X a — >■ Y a such that for all a < b G M n , o 4> x {a,b) = 
</>y(a, b)o f a . This definition of morphism gives the cellular n-filtrations the structure 
of a category. 

Simplicial n-filtrations can be defined analogously. 

2.7.3 Multidimensional Persistent Homology 

The multidimensional persistent homology functor Hi is a generalization of the or- 
dinary homology functor with field coefficients to the setting where the source is an 
n-filtration and the target is a -B„-persistence module. We first present a definition 
of the singular multidimensional persistent homology functor; we introduce cellular 
multidimensional persistent homology below. 

Singular Multidimensional Persistent Homology 

For a topological space X and j G Z> , let Cj(X) denote the j th singular chain 
module of X, with coefficients in k. For X, Y topological spaces and / : X — > Y a 
continuous map, let /* : Cj(X) — > Cj(Y) denote the map induced by /. 

For X an n-filtration, define Cj(X), the j th singular chain module of X, as the 
-B n -persistence module for which Cj(X) u = Cj(X u ) for all u G M. n and for which 
(pCj(x)(u,v) = (f> x (u,v). Note that for any j G Z> , the collection of boundary maps 
{5j : Cj(X u ) — > Cj_i(X u )} ue ]Rn induces a boundary map Sj : Cj(X) — > Cj-i(X). 
These boundary maps give {Cj(X)}j>o the structure of a chain complex. We define 
the Hj(X), the j th persistent homology module of X, to be the j th homology module 
of this complex. For X and Y two n-filtrations, a morphism / G hom(X, Y) induces in 
the obvious way a morphism Hj(f) : Hj(X) — > Hj(Y), making Hj : n-filt— >■ -B„-mod 
a functor. 
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Cellular Multidimensional Persistent Homology 

A construction analogous to the one above, with cellular chain complexes used in 
place of singular chain complexes, yields a definition of the cellular multidimensional 
persistent homology of cellular n-filtrations. For a cellular filtration X, let C^ W {X) 
denote the j th cellular chain module of X, let 5f w : Cf w (X) -»■ Cf*[(X) denote the 
j th cellular chain map of X, and let Hj(X) denote the j th cellular multidimensional 
persistence module of X. 

Remark 2.7.1. It follows from the naturality of the isomorphisms between singu- 
lar and cellular homology that the singular and cellular multidimensional persistent 
homology modules of a cellular n-filtration are isomorphic. 

2.7.4 Functors from Geometric Categories to Categories of 
n-filtrations 

In applications of persistent homology one typically has some geometrical category 
of interest and a functor F from that category to n-filt for some n G N; one then 
studies and works with the functor HioF. These composite functors are also generally 
referred to as i th persistent homology functors. There thus are a number of different 
such i th persistent homology functors with different sources, each determined by a 
different choice of the functor F. 

We next define several functors F : C — > n-filt, where C is some geometric cat- 
egory. We introduced special cases of some of the definitions presented here in Sec- 
tion 11.5.11 We'll use these functors to formulate and prove our inference results in 
Chapter 4. 

In Section 12.81 we will prove stability results for Hi o F for each of functors F 
introduced here. 

In the following examples, we omit the specification of the action of these functors 
on morphisms. This should be clear from context. 

2 We are using the word "geometric" here in an informal (and rather broad) sense. 
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Recall from Section [2.1.11 that for a = (ai, a n ) G M n and b G R, we let (a, 6) = 
(a!,...,a n ,6) GK n+1 . 

Example 2.7.2. Sublevelset Filtrations 

Let C s be the category defined as follows: 

1. Objects of C s are pairs (X, /), where X is a topological space and / : X — > M n 
is a function. 

2. If (XJ),(X'J f ) G obj(C 5 ), then we define hom c s((X, /), (X', /')) to be the 
set of functions 7 : X — > X' such that /(a;) > f'(^{x)) for all a; G X. 

For X a topological space, let denote the subcategory of C s consisting of pairs 
of the form (X, f). 

Define the functor F s : C s — > n-filt on objects by taking 

F s (X,f) a = f a 

for all a6l" (and taking the map <i>F s (xj){ a -, b) to be the inclusion for all a < b G 
W 1 .) We refer to F s as the sublevelset filtration functor, to F S (X, f) as the 
sublevelset filtration of /, and to F S (X, —f) as the superlevelset filtration of 

/• 

Example 2.7.3. Sublevelset-Offset Filtrations 

We define two variants of the sublevelset-offset filtration functor, the closed and 
open variants. It is the closed variant that is of greater interest to us and that we use 
in the statement of our theorems; we define the open variant purely as a technical 
tool for use in the proofs of the results of Chapter 4. 

Let C so be the category defined as follows: 

1. Objects of C so are quadruples (X, Y, d, f), where (Y, d) is a metric space, X C 
Y, and / : X — )■ W 1 is a function. 

2. If (X, Y, d, f), (X', Y', df, f) G obj(C so ), then we define hom c so((X, F, d, /), (X', Y, d', f )) 
to be the set of functions 7 : Y — > Y' such that 7(X) C X', f[x) > /'(7( x )) f° r 

all x G X, and d(y 1 ,y 2 ) > d'(j(y 1 ),j(y 2 )) for all y u y 2 G Y. 
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For (X, X, d, f) G C so , we will sometimes write (X, d, f) as shorthand for (X, X, d, f). 
For Y a fixed topological space, let C^d) denote the subcategory of C so consisting 
of quadruples of the form (•, Y, d, •). 

Define the functor F so : C so — > (n + l)-filt on objects by taking 

F 5 °(X, Y, d, f) {a>b) = {ye Y\d(y, f a ) < b} 

for all (a, b) G W 1 x R. We refer to F 50 as the (closed) sublevelset-offset filtra- 
tion functor, to F so (X,Y, d, f) as the sublevelset-offset filtration of /, and to 
F so (X, Y, d, —f) as the superlevelset-offset filtration of /. 

Similarly, define the functor F so ~° p : C so — > (n + l)-filt on objects by taking 

F s °-°v(X, Y, d, f) {a , b) = {ye Y\d(y, f a ) < b} 

for all (a, b) ePx M. We refer to F so ~° p as the open sublevelset-offset filtration 
functor. 

Example 2.7.4. Sublevelset-Cech Filtrations 

As we did for sublevelset-offset filtrations, we define closed and open variants of 
the sublevelset-Cech filtration functor. 

Let C SCe denote the full subcategory of F so whose objects are the quadruples 
(X, Y, d, f) with X finite; let C^d) denote the full subcategory of C SCe consisting of 
quadruples of the form (•, Y, d, •). 

Recall from Section ["1.5.11 that given a metric space {Y,d), a subset X C Y, and 
b G M, we let Cech (X, Y, d, b), the (closed) Cech complex of (X,d) with parameter 
b, be the abstract simplicial complex with vertex set X such that for / > 2 and 
Xi, x 2 , -.,xi G X, Cech (X, Y, d, b) contains the (/ — l)-simplex [xi, xi] iff there is a 
point y G Y such that d(y, x^) < b for 1 < i < I. 

Let Cech °(X, Y, <i, 6), the open Cech complex of (X, <i) with parameter 6, be 
defined the same way as the closed Cech complex, except that we specify that 
Cech °(X, Y, d, b) contains the (/ — l)-simplex [xi, xj] iff there is a point y G Y 
such that d(y, Xi) < b for 1 < i < I. 
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Define the functor F SCe : C SCe — > (n + l)-filt on objects by taking 

F sde (X, Y, d, f) {a>b) = Cech (f a , Y, d, b) 

for all (a, b) e W 1 x R. We refer to F 5Ce as the (closed) sublevelset-ofFset filtra- 
tion functor, to F SCe (X, Y, d, f) as the sublevelset-offset filtration of /, and to 
F SCe (X, Y, d, —f) as the superlevelset-offset filtration of /. 

Similarly, define the functor F SCe ~° p : C SCe — > (n + l)-filt on objects by taking 

F SCe -°v(X, Y, d, /) (a , 6) = Cech °(/a, Y, d, b) 

for all (a, b) E M n x]R. We refer to F SCe ~° p as the open sublevelset-Cech filtration 
functor. 

Example 2.7.5. Sublevelset-Rips Filtrations 

Let C SR be the full subcategory of C so whose objects are quadruples (X, X, d, f), 
where X is a finite metric space. 

Recall from Section 11.5.11 that given a finite metric space (X, d) and & £ 1, we let 
Rips(X, d, b), the Rips complex of (X,d) with parameter b, be the maximal abstract 
simplicial complex with vertex set X such that for 21,22 £ X, the 1-skeleton of 
R(X,d,b) contains the edge [21,22] iff d(xi,X2) < 2b. 

Define the functor F SR : C SR — > (n + l)-filt on objects by taking 

F SR (X,d,f) (a , b) = Rips(f a ,d,b) 

for all (a, b) eR n x R. 

We refer to F SR as the sublevelset-Rips filtration functor, to F SR (X, d, f) as 
the sublevelset-Rips filtration of /, and to F S (X, d, —f) as the superlevelset- 
Rips filtration of /. 

The functor F SR has previously been considered (in the case n — 1) in 0. 



CHAPTER 2. INTERLEAVINGS ON MULTI-D PERSISTENCE MODULES 72 



2.7.5 Metrics on Geometric Categories 

We now define metrics on the isomorphism classes of objects of the geometric cate- 
gories we defined in the last section. We'll use these to formulate stability results for 
dj and to formulate various notion of optimality for dj. 

A Metric on obj(C x ) 

Let X be a topological space. We define a metric d s x on obj(C^) by taking 

4((X, A), (X, / 2 )) = sup ||A0r) - f 2 (x) \U 

xex 

A Metric on obj(C$? d) ) 

Let (Y, d') be a metric space. We now define a metric d^y^ on obj(C^? d ,^) by taking 

df° d , ) ((X 1 ,Y,d'J 1 ),(X 2 ,Y,d'J 2 )) = 

max{ sup inf max(d'(x 1 ,x 2 ),\\fi(x 1 ) - f 2 (x2)\\oo), 
xieXx X2ex 2 

sup inf max(d'(x 1 ,x 2 ), \\fi{xi) - / 2 (z2)||oo) } 

This distance is a sort of "function aware" variant of the Hausdorff distance. Since 
C^Yd) i s a subcategory of C?y d y our metric on obj(C^? d ^) restricts to a metric on 
obj(^). 

A Metric on ohj*(C SR ) 

Generalizing in a mild way a definition of [9], we define a metric d SR on obj*(C SR ). 
For f x , f Y = 0, d SR ((X, dx, fx), (X, dx, fy)) will be equal to the Gromov-Hausdorff 
metric [Sj. (In fact, the definition extends to the subcategory of C so whose objects 
are the triplets (X, dx, fx) with X compact, but we won't need the extra generality 
here.) 

To define d SR , we need some preliminary definitions and notation. Define a cor- 
respondence between two sets X and Y to be a subset C E X x Y such that Vx e X, 
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3y G Y such (x,y) G C, and Wy G Y, 3x G X s.t. G C. Let C(X, F) denote 

the set of correspondences between X and Y . 

For (X, d x , fx), (Y, d Y , f Y ) e C SR , define Y X)Y ■ X x Y x X x Y -> M> by 

?x,Y{x,y,rf = \d x (x,x') - d Y (y,y')\. 

For C G C(X, F), define r c as sup (:c>2/)>(a ., j2/ , )eC T Xi y (x, y, ar', and define \f x - fy\c 
to be sxvpr x y \ eC \\fx(x) — /y(y)||oo- Informally, Tq is the maximum distortion of the 
metrics under the correspondence C, and \fx — fy\c is the maximum distortion of 
the functions under C. 

Now define we define d by taking 

d SR ((X, d x , fx), (Y, d Y , f Y )) = inf max(lr c , \f x - f Y \ c ). 

C&C(X,Y) 2 

2.7.6 Stability Results for Ordinary Persistence 

There are two main geometric stability results for ordinary persistence in the litera- 
ture. (Though see also the generalization [5]) Each is a consequence of the algebraic 
stability of persistence [8]. 

Theorem 2.7.6 (1-D Stability Result for C x [S]). For any i G Z> , topological space 
X, and functions f\,f'2'-X^-M. such that Hi o F S (X, f\) and Hi o F S (X, f'2) are 
tame, 

d B (H t o F S (X, h), Hi o F S (X, f 2 )) < 4((X, /0, (X, f 2 )). 
For a 2-D filtration F, let diag(F) denote the 1-D filtration for which diag(F) a = 

F(a,a) ■ 

Theorem 2.7.7 (1-D Stability Result for C SR [9]). For finite metric spaces (X, d x ), (Y, d Y ) 
and functions fx '■ X — > K ; f Y : Y — > R ; 

d B (^odiag(F 5R (X, d x , fx)), ^odiag(F 5R (F, d Y , f Y ))) < d SR ((X, d x , fx), (Y, d Y , fv))- 



We'll see in Section 12.81 that both of these results admit generalizations to the 
setting of multidimensional persistence in terms of the interleaving metric. 
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2.8 Stability Properties of the Interleaving Dis- 
tance 

In this section, we observe that multidimensional persistent homology is stable with 
respect to the interleaving distance in four senses senses analogous to those in which 
ordinary persistent homology is known to be stable. As noted in the introduction, 
there is not much mathematical work to do here. Nevertheless, these observations 
are significant (if not particularly surprising) not only because they show that the 
interleaving distance is in several respects a well behaved distance, but also because 
stability is closely related to the optimality of distances on persistence modules as we 
define it in Section I2.9( insofar as we wish to understand the optimality properties of 
the interleaving distance, the stability properties of the interleaving distance are of 
interest. 

2.8.1 Stability of Sublevelset Multidimensional Persistent Ho- 
mology 

Theorem 2.8.1. For any topological space X and pairs (X, /i), (X, f 2 ) G obj(C^) 
we have, for any i G Z> 0; 

di{Hi o F S (X, /i), Hi o F S (X, f 2 )) < 4((X, A), (X, / 2 )). 

The case n = 1 is Theorem 12. 7.61 

Proof. Let d s x ((X } A), (X, f 2 )) = e. Then for any u e R n , F S (X, h) u C F S (X, f 2 ) u+e 
and F S (X, f 2 ) u C F S (X, fi) u +e- The images of these inclusions under the i th singular 
homology functor define e-interleaving morphisms between Hi o F S (X, fi) and Hi o 
F S (X, f 2 ). Thus d^Hi o F S (X, / x ), H { o F S (X, f 2 )) < e as needed. □ 



CHAPTER 2. INTERLEAVINGS ON MULTI-D PERSISTENCE MODULES 75 



2.8.2 Stability of Sub levelset- Offset Multidimensional Per- 
sistent Homology 

Theorem 2.8.2. For any metric space (Y,d') and (Xi, Y, d', fi), (X 2 , Y, d', f 2 ) G 
oh](C^Yd>)) we have, for any i G Z> , 

di{H i0 F S0 {X x , Y, d', A), ^oF so (X 2 , Y, d', f 2 )) < df° dl) {{X u Y, d', ft), (X 2 , Y, d', f 2 )). 
Proof. This is similar to proof of the previous result. Let 

df° d , } ((X u Y, d', A), (X 2 , Y, d', f 2 )) = e. 
Then for any u G M n , r6l, and e' > e, 

F so (X l7 Y, d', f x ) M C F so (X 2 , Y, d', / 2 )( tt+e /, P+e /) and 



(«,r) -r ^2, -T , « , J2j(w+e',r+e') 

F 5, °(X 2 , Y, cf, f 2 )( u ,r) C F (Xl, F, <i', /i)( u+e / )r . +e ')- 



The images of these inclusions under the i singular homology functor define e'- 
interleaving morphisms between Hi o F (Xi, Y, d', f\) and Hi o F so (X 2 ,Y,d',f 2 ). 
The result follows. □ 



2.8.3 Stability of Sublevelset-Cech Multidimensional Persis- 
tent Homology 

Theorem 2.8.3. For any (Xi, Y, d', f 1 ),(X 2 , Y, d, f 2 ) G obj(C ( 5 y c d e /} ) and i G Z> 0; 

dj^oF^iX,, Y, d', A), ^oF 5(5e (X 2 , F, d', f 2 )) < df^dX,, Y, d', fx), (X 2 , y, d', / 3 )). 

Proof. First note that the definition of _F 5Ce extends to quadruples (X, Y, cf, /) where 
X is a multiset f| whose points each have finite multiplicity and whose underlying set X 
is a finite subset of Y. Further note that for any (a, b) G M n x R, F 5<5e (X, y, <f , f)( a ,b) 



3 Informally, a multiset is a set whose elements are allowed to have multiplicity greater than or 
equal to one. 
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is a deformation retract of F (X, Y, d', f)( a ,b), and the deformation retracts can be 
taken to commute with the inclusion maps in these nitrations. Thus 

Ht o F s0e (X, Y, df, f) (a , b) = ^ o F sCe (X, Y, d', 

Let d?Yd>)((Xi, Y, d', f\), (X 2 , Y, d', f 2 ) = e. Then we can choose multisets X 1 and 
X 2 whose underlying sets are X\ and X 2 respectively, such that for all e' > e there is 
a bijection A : X\ — > X 2 such that 

1. for all x G X 1; d'(x, X(x)) < e; and ||/i(x) — /a(A(a;))|| 00 < e' and 

2. for all x G X 2 , d'(x, A" 1 (a:)) < e' and - f2(X~ 1 (x))\\ 00 < e'. 

This implies that HioF s6e (X h Y, d' , fx) and H t oF sCe (X 2 , Y, d', f 2 )) are e'-interleaved. 
Thus Hi o F sde (X u Y, d', /i) and H { o F sde (X 2 , Y, d', / 2 )) are also e'-interleaved, and 
the result follows. □ 

2.8.4 Stability of Sublevelset-Rips Multidimensional Persis- 
tent Homology 

Theorem 2.8.4. For (Xx,dx,fx), (X 2 ,d 2 ,f 2 ) G obj(C Si? ) we have, for any i G Z>o, 

d^Hi o F SR (X u d u fx), ^ o F SR (X 2 , d 2 , f 2 )) < d SR {{X x , d u fx), (X 2 , d 2 , f 2 )). 

The proof of this is a very minor modification of the argument given in [S] to prove 
Theorem 12.7.71 

Note that Theorem E3H implies TheoremEZU When n = 1, if HioF SR (Xx, d u fx) 
and HioF SR (X 2 , d 2 , f 2 ) are e-interleaved, for e G M> , then odiag(F 5fi (X l5 d 1; /i)) 
and ifj o diag(F 5fi (X 2 , <i 2 , A)) are e-interleaved. 

2.9 Optimal Pseudometrics 

In this section we introduce a relative notion of optimality of pseudometrics on persis- 
tence modules and their discrete invariants. This relative notion of optimality is quite 
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general and specializes to a number of different notions of optimality of psuedometrics 
of interest in the context of multidimensional persistence. 

We also present some first theoretical observations about optimal pseudometrics. 
We'll exploit these in Section \2. 101 to prove our optimality result for the interleaving 
distance. 

2.9.1 A General Definition of Optimal Pseudometrics 

Let Y be a set. We define a relative structure on Y to be a triple 1Z = (T, Xq-, fj-), 
where T is a set, X T = {(X s , d s )} seT is a collection of pseudometric spaces indexed by 
T, and fr = {f s '■ X s — > Y} se j- is a collection of functions. Let im(ff) = U se ri m (/s)- 

If Y is a set and 1Z = (T, X-r, fj-) is a relative structure on Y , we say a semi- 
pseudometric d on Y is 7^-stable if for every s G T and xi,X2 £ X s we have 
d(f s (xi), f s (x 2 )) < d s (x 1 ,x 2 ). 

We say a pseudometric d on Y is 7?.-optimal if d is 7?.-stable, and for every other 
7^-stable pseudometric d! on Y, we have <f (2/1,2/2) < ^(2/1? 2/2) for all 2/1,2/2 £ i m (/r). 

The following lemma is immediate, but important to understanding our definition 
of 7£-optimality. 

Lemma 2.9.1. An TZ-stable pseudometric d is H- optimal iff for any other TZ-stable 
pseudometric d! , s6T, and X\,x 2 G X s , 

\d s (x 1 ,x 2 ) - d(f s (x 1 ),f s (x 2 ))\ < \d s {x u x 2 ) - d'{f s {x 1 ),f s {x 2 ))\. 

Note that if an 7£-optimal pseudometric donY exists, its restriction to im(/-7-) x 
im(/7-) is unique. 

2.9.2 Examples 

Here we give several examples of sets Y and relative structures (7~, Xf, fr) on Y for 
which it would be interesting or useful from the standpoint of the theory and applica- 
tion of multidimensional persistent homology to identify an 7?.-optimal pseudometric. 
In Section 12.101 we will focus exclusively on the relative structures of Examples 12.9.21 
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and I2.9.4j we leave it to future work to investigate in detail the optimality of pseu- 
dometrics with respect to the relative structures of Examples I2.9.5H2.9U1 
For examples let Y = obj*(S n -mod). 

Example 2.9.2. Let TZ% = (T,Xj-, /t)j where T is the set of pairs {(T,i)\T is a 
topological space, i G Z> }, X(r,i) = obj (C|i) for (T, z) G T, d(r,i) = ^f"; an d is 
given by f (T ,i){T,g) = H { o F 5 (T, 3). 

Remark 2.9.3. By Lemma [2.9. 1[ an T^i-stable pseudometric d is 7£i-optimal iff for 
any other 7?i-stable pseudometric d', any (T,i) G T, and any (T,gi), (T,g 2 ) G X(r,i), 

\d s T ((T, 9l ), (T, ^)) - o F S (T, 9l ), ^ o F 5 (T, g 2 ))\ 
<\d s T ((T, <7i), (T, <&)) - <f (tf, o F 5 (T, <fc), Hi o F 5 (T, 92 ))|. 

This says that an 7?4-optimal psuedometric is one for which the distance between 
any two functions defined on the same topological space is preserved under the mul- 
tidimensional persistent homology functor as faithfully as is possible for any choice 
of T^i-stable psuedometric on obj*(5 n -mod). 

For the relative structures 1Z in the rest of the examples below, 7?.-optimality has 
an interpretation by way of Lemma 12.9.11 analogous to that of Remark 12.9.31 

Example 2.9.4. For i G Z> , let 7Zi ti = (T, Xj-, fr), where T is the set of topological 
spaces, X T = obj(C^) for T G T, d T = d§,, and is given by fo(T, g) = HioF s (T, g). 

Since the definitions of 1Z\ and 1Z\^ are similar, one might expect that there's a 
relationship between 7?-i-optimality and 7?.i >; j-optimality. Corollary 12.10.21 establishes 
such a relationship in the cases k = Q and k = Z/pZ for some prime p. 

Our reason for considering both TZi and IZij, rather than only IZij, is primarily 
technical: In short, the reason is that Section [2.101 give a proof of the optimality of 
di relative to only for i G N but not for i = 0; by considering optimality of (ir- 
relative to TZi we are able to obtain an optimality result which doesn't depend on 
i, and thus has a certain aesthetic appeal. I suspect that dj is also T^o optimal; 
once this is shown to be the case, there will, in my view, be little reason to consider 
T^i-optimality in formulating the results of this thesis. 
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Example 2.9.5. Let T be the set of triples {(T, d', i)\T is a topological space, d! is a 
metric on T, i G Z> }, X (TjdV) = obj(C^? d/) ) for (T,d',z) G T, rf(r,d',i) = ^f^, an d 
f(r,d',i) be given by / (T ,^)(?7, T, d', (/) = ^ o F so (f/, T, <f, g). 

Example 2.9.6. Let T be the singleton set {s}. Let X s = obj*(C 5jR ), d s = d SR , and 
/. be given by f s (T, d', g) = H l o F SR (T, d', g). 

Example 2.9.7. Let T be as in example 12.9.51 Define Xj- by taking X<t^^\ = 
obj*(C SCe ) and d (T ,#A = df° d , y Let f (T ,d>,i)( U , T > d '> 9)= H i° F SCe (U, T, d', g). 

Example 2.9.8. We can present variants of examples 12.9.51 T2.9.61 and 12.9.71 where 
we only consider homology in a single dimension, in the same way we did for Exam- 
ple I2TO in Example I21T1 

Example 2.9.9. Let W : obj*(-B n -mod) ->7bea discrete invariant [7] with values 
in a set Y, let (T, Xf, fj-) be any relative structure on obj*(-B n -mod), and let fr = 
W o f' r . Then (T, Xf, fr) is a relative structure on W. 

For example, we can take W to be the rank invariant [7] and (T, Xf, fj-) t° be 
the relative structure of Example 12.9.21 

2.9.3 Induced Semi-Pseudometrics and a Condition for the 
Existence of Optimal Pseudometrics 

We'll see here that a relative structure TZ = (T, Xf, fr) on a set Y induces a semi- 
pseudometric dn on im(/-7-) with a nice property. 
For y u y 2 e im(/ r ), let 

-4(2/1, 2/2) = {(s,xi,x 2 )|s eT,x 1 e X s ,x 2 G X a ,f a {xi) = yi,f s {x 2 ) = y 2 }. 

Now define dn(yi,y 2 ) = mi( SjXljX2 ) €A ( yi ^ 2 ) d s (xi,x 2 ). dn is an Testable semi- 
pseudometric. In general it needn't satisfy the triangle inequality. However, if T 
is a singleton set, as for instance in Example I2.9.6[ then dn does satisfy the triangle 
inequality and is a pseudometric. 
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Lemma 2.9.10. for any H- stable pseudometric d on im(fj-), d < dfc. In particular, 
if d-R is a pseudometric, it is 1Z- optimal. 

Proof. Let d be an Testable pseudometric on im(/-7-). Since d is Testable, d(yi,y 2 ) < 
d s {x 1 ,x 2 ) for all (s,x 1 ,x 2 ) E A(y 1 ,y 2 ). Thus d(y u y 2 ) < m{ {s ^ X2)&A{y ^ y2) d s (x 1 ,x 2 ) = 

It's easy to see that a pseudometric d on im(/7-) can be extended (non-uniquely) to 
a metric on Y; if d is 7£-optimal then, by definition, any extension to a pseudometric 
on Y is as well. Thus by the Lemma r2.9.10[ if d-ji is a pseudometric, then an 7?.-optimal 
pseudometric exists on Y; its restriction to im(/7-) is dfc. It follows, for example, that 
for 1Z as in Example 12.9.61 an 7?.-optimal metric exists. 

In Section 12.101 we'll show that for Y = obj*(5 n -mod), i e N, and k = Q or 
k = Z/pZ for some prime p, the restriction of dj to the domain of d^. is equal to 
d-Tz 1:i , so that di is T^i^-optimal. It will follow easily that dj is also 7?.i-optimal. 



2.10 Optimality of the Interleaving Distance (Rel- 



Summary of Results 

The central result of this section is the following theorem: 

Theorem 2.10.1. For k = Q or Z/pZ for some prime p, and % 6 N, dj is 1Z\^- 
optimal. 

This theorem also yields the following weaker optimality result, which has aes- 
thetic advantage of not depending in its formulation on a choice of homology dimen- 
sion. 

Corollary 2.10.2. For k = Q or Z/pZ for some prime p, dj is 1Z\- optimal. 



ative to Sublevelset 




We also present analogues of Theorems 12.10.11 and Corollary 12.10.21 for well be- 
haved -E?i-persistence modules which hold for arbitrary fields k; these are given as 
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Theorem 12.10.81 Our analogue of Theorem 12.10.11 for well behaved -Bi-persistence 
modules holds for i e Z> . 

Remark 2.10.3. I suspect that theorem 12.10.11 holds, more generally, for arbitrary 
fields and for % e Z> , though we do not prove that here. To orient the reader, it may 
be helpful to point out that such a generalization would subsume both Corollay l2.10.2l 
and Theorem 12.10.81 

2.10.1 Optimality of Interleaving Distance and Geometric 
Lifts of Interleavings 

Proof of Theorem \2.10.1\ Fix k = Q or k = Z/pZ for some prime p. For % e Z>o, 
Lemma [2.9. 101 implies that to prove that dj is T^j-optimal, it's enough to show that 
the restriction of di to the domain of d^ . is equal to d-R lt . We show that for iGN, 
this follows from the following proposition, which we can think of as guaranteeing the 
existence of "geometric lifts" of interleavings to the category C s . 

Proposition 2.10.4. [Existence of Geometric Lifts of Interleavings] Let k = Q or 
k = Z/pZ for some prime p. If i G N, and M and N are e-interleaved B n -modules, 
then there exists a CW-complex X and continuous functions 7m, 7tv '■ X — > M. n such 
that Hi o F S (X, 7 m) =M, Hio F S (X, 7iV ) = N, and || 7m - 7w||oo = e. 

Sections I2.6.2tl2~10.6l are devoted to the proof of Proposition 12.10.41 

Corollary 2.10.5. For k = Q or X/pX for some prime p, for every B n -module M 
and iGN, there exists a CW-complex X and a continuous function 7 : X — > MJ 1 such 
that H % o F S (X, 7 ) = M. 

To see that Proposition 12.10.41 implies that di is equal to d^ . on their common 
domain, let M and iV be two S n -persistence modules such that di(M, N) = e. Choose 
5 > 0. Then M and iV are are (e + <5)-interleaved. By the proposition, there exists 
a topological space X and maps jm,Jn '■ X — » W 1 such that Hi o F S (X, 7m) — M, 
HioF s (X,^ N ) = N, and || 7m -7tv||oo = e + 5. Thus by stability, d n ^(M,N) < e + 5. 
Since this holds for all 5 > 0, d Ul . (M, N) < e. By LemmaEHini d Kl i (M, N) — e. □ 
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Note that the extension of Theorem 12.10.11 to the case i = would follow by the 
same argument from the following conjectural extension of Proposition I2.10."4l to the 

case 2 = 0. 

Conjecture 2.10.6. Let k = Q or k = Z/pZ for some prime p. Let M and N be 

e-interleaved B n -modules such that for some topological spaces Xm, Xn and functions 
f M : X M -> W 1 , f N :X N ^ R n , H o F S (X M , f M ) = M and H o F S (X N , f N ) = N. 
Then there exists a CW-complex X and continuous functions JM)Jn '■ X — » M n such 
that H o F S (X, lM ) = M, H o F s (X, 7jv ) = and || 7m - 7 iv||oo = c 

We do not prove this conjecture here. 

Proof of CorollaryWJM Fix any % G N. Write Tl x = (S, X s , fs), Ki,i = (S', X s >, f s .). 
By Theorem 12.8.11 dj is 7?.i-stable. Further, any 7^i-stable pseudometric d' on 
obj*(-B n -mod) is 7^-i^-stable. By Corollary I2.10.5[ when k = Q or Z/pZ for some 
prime p, xm(fs>) = im(fs) = obj*(5 n -mod). Thus, if d! is any l^x stable metric, 
d'(M,N) < di(M,N) for any M, N G im(f s ) by the fti, r optimality of dj. Thus di 
is 7?.i-optimal. □ 

2.10.2 Optimality of the Bottleneck Distance 

Before proceeding with the proof of Proposition 12.10.41 we present optimality results 
for the bottleneck distance. 

Let Y denote the set of isomorphism classes of tame -Bi-persistence modules. 
Since the bottleneck distance ds is only defined between elements of Y, formulating 
statements about the optimality of ds requires that we consider a relative structure 
on Y rather than on obj*(i?i-mod). We can (in the obvious way) define restrictions 
IZ2 and IZ2A of the relative structures 1Zi and IZij to relative structures on Y. 

Then, given Proposition 12. 10. "4^ the proofs of Theorem 12. 10. II and Corollary l2.10.2l 
adapt to give that for k = Q or k = Z/pZ for p a prime, ds is ^-optimal, and for 
any i G N, ds is also 7^ 2ii -optimal. 

Moreover, if we further restrict our attention to well behaved Si-persistence mod- 
ules then we can prove an optimality result for arbitrary fields k and persistent ho- 
mology of any dimension. 
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The key is the following variant of Proposition 12.10.41 

Proposition 2.10.7. Let k be any field and let i G Z>o. Let M and N be well 
behaved B\-persistence modules with ds{M,N) = e. If i = 0, assume further that 
for some topological spaces Xm, Xn and functions fu '■ Xm — > K. ; /n '■ Xn — > M., 
H o F S (X M , fu) = M and H o F S (X N , f N ) = N. Then for any 5 > 0, there exists a 
CW-complex X and continuous functions jm, In '■ X — > R such that HioF s (X, j M ) = 
M, Hi o F s {X,-f N ) = N, and || 7m - 7tv||oc <e + 5. 

Proof. An easy constructive proof, similar on a high level to our proof of Proposi- 
tion |2TTU31 below but much simpler, follows from the definition of ds and the structure 
theorem for well behaved persistence modules (Theorem I2.3.6p . We leave the details 
to the reader. □ 

Now let Y' denote the set of isomorphism classes of well behaved -Bi-persistence 
modules. Let 'Kj, and 7^3^ denote the restrictions of the relative structures and 
7^1,1 to relative structures on Y'. 

Our optimality result for the bottleneck distance on well behaved persistence 
modules is the following: 

Theorem 2.10.8. For any field k and i G Z>o, g?b is IZ^^-optimal. Further, ds is 
1Z%- optimal. 

Proof. Using Proposition 12.10.71 in place of Proposition I2.10.4[ the proofs of Theo- 
rem 12.10.11 and Corollary 12.10.21 carry over with only minor modifications to give the 
result. □ 

Remark 2.10.9. In the case of 0-D ordinary persistence, our Theorem 12.10.81 implies 
a slight weakening of the optimality result of [21]. It is easy to check that in the 
geometric context considered in [2TJ , the persistent homology modules obtained must 
satisfy property 2 in the definition of a well behaved -Bi-persistence module, but they 
needn't satisfy property 1. To strengthen our optimality result to a full generalization 
of the result of |21j . we'd want a generalization of Theorem 12 . 3 . 61 to all I?i-persistence 
modules satisfying property 2. We presume that this can be obtained via a slight 
strengthening of Theorem 12.2.21 but we do not pursue the details of this here. 
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2.10.3 Proof of Existence of Geometric Lifts of Interleavings, 
Part 1: Constructing the CW-complex 

The rest of Section 12.101 is devoted to the proof of Proposition 12.10.41 

Corollary 12.6.51 gives us n-graded sets Wi,yV 2 and homogeneous sets 3^1? 3^2 C 
(Wi,W 2 ) such that y t G (Wi, W 2 (-e)), y 2 G (Wi(-e), W 2 ), and 

M= (Wi,W 2 (-e)|yi,y 2 (-e)), 
iV= (Wi(-e),W 2 |yi(-e),y 2 >. 

Given such Wi, W 2 , 3^i, 3^2, we proceed with the proof of Proposition 12.10.41 be- 
ginning with the construction of the CW-complex X whose existence is posited by 
the proposition. 

Write W = Wi U W 2 and y = y 1 U y 2 . If M = N = 0, Proposition EM clearly 
holds, so we may assume without loss of generality that M and N are not both trivial. 
Under this assumption, W^l. 

For an n-graded set S, let GCD(S) = (vx,...,v n ), where Vi = mi s€S gr(s)i. In 
general, some of the components of GCD(W) may be equal to — oo. To simplify our 
exposition, we first present the remainder of the proof of Proposition 12.10.41 in the 
special case that GCD(W) G W 1 . The adaptation of our proof to the general case is 
reasonably straightforward; we outline this adaptation in Section 12.10.61 

We'll define X so that 

1. X has a single 0-cell B. 

2. X has an i-cell e l w for each w G W. 

3. X has an (i + l)-cell e^ +1 for each y G y. 

For such X, the attaching map for each z-cell e l w must be the constant map to B. 
To define X, then, we need only to specify the attaching map a y : S" 1 — >■ X* for each 

We do this for k = Q, and leave to the reader the easy adaptation of the con- 
struction (and its use in the remainder of the proof of Proposition 12.10.41) to the case 
k = Z/pZ. 
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For any y G y, we may choose a finite set W y C W such that gr{w) < gr{y) for 
each w G W y , and 



for some a' wy G Q. There's an integer z such that for each a' wy in the sum, za' wy G Z. 
Let For w G" W y , define a wy = 0. 

Lemma 2.10.10. There exists a choice of attaching map a y : S % — >■ X* /or eac/i y E y 
such that the CW- complex X constructed via these attaching maps has o~j^-i{e y +1 ) = 
J2 w ew a ^y e i f or allyey. 

Proof. Let p % be as defined in Section 12.7.11 For each w G W, p l and the charac- 
teristic map Q w induce an identification of im($ w ) with an z-sphere S^. We have 
that (X\ B) = A^ e >v(5'i,) B). Choose a basepoint o G S l and for each w G W y , 
let cr^y : (S\o) —> (S l w ,B) be a based map of degree a wy . [c wy ] G iii(X\B), where 
7Tj(X J , S) denotes the i th homotopy group of X 1 with basepoint B. 

Order the elements of W y arbitrarily and call them w\, ...wi. Let a y : {S l ,o) — > 
(X\ B) be a map in [<J wiy \ ■ [o~ W2y \ ■ ... • [o~ Wiy \ G VTj(X*, B). Then for any w G W, q w oa y 
is a map of deg (See Section [2.7.11 for the definition of q w ). By the definition 

of Sii l given in Section 12. T. H the lemma now follows. □ 

We choose the attaching maps a y so that 5f +l has the property specified in 
Lemma 12.10.101 

2.10.4 Proof of Existence of Geometric Lifts of Interleavings, 
Part 2: Defining jm an d 

Having defined the CW-complex X, we next define 7A/,7iv : X — > M n . 



X is the quotient of X under the equivalence relation generated by the attaching 
maps of the cells of X. Let 7r : X — > X denote the quotient map. For a topological 
space A, let £(A,M. n ), denote the space of continuous functions from A to M. n . The 




(2.3) 



Let X = {B} U wew Dl U yey D y 
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map ~ : €(X, R n ) — > <£(X,R n ) defined by f(x) = f(ir(x)) is a bijective correspon- 
dence between elements of £(X, R n ) and elements of £(X,M n ) which are constant on 
equivalence classes. 

In what follows, we'll define 7m and 7^ by specifying their lifts 7m,7jv- 

We'll take each of our functions 7m, In to have the property that for each disk of 

X, the restriction of the function to any radial line segment (i.e. a line segment from 
the origin of the disk to the boundary of the disk) is linear. Given this assumption, 
to specify each function it is enough to specify its values on the origins of each disk 
of X. We now do this. 

For any i G N and any unit disk D in M 4 , let 0(D) denote the origin of D. 

• ^ M (B)=^f N (B) = GCD(W); 

• For x G Wi U y u 7m(0(D x )) = gr(x); 

• For x G W2 U y 2 , 7m(0(D x )) = gr(x(-e)). 

• For x G Wi U y u >y N (0(D x )) = gr(x(-e))- 

• For x G W2 U y 2 , 7 N (0(D X )) = gr(x). 

Lemma 2.10.11. ||7m — 7at||oo = e. 

Proof. Assume that for a disk D of X, |7m(o) — 7at(o)| < e for all a G 5-D, and that 
|7m(0(-D)) -Zf N (0(D))\ = e. We'll show that then \^ M (a) -7jv(a)| < e for all x £ D. 
Applying this result once gives that the lemma holds on the restriction of 7m, 7jv to 

X I . Applying the result a second time gives that the lemma holds on all of X. 

To show that |7m(g0 — 7jv( a )l < e , let x be a point in D and write a = tO(D) + 
(1 — £)6 for some b G and < t < 1. Since the restrictions of 7m and 7^ to any 
radial line segment from 0(D) to 5D are linear, we have that 7m (a) = £7m(0(-D)) + 
(1 - t)7Af(&), and 7JV (a) = t^ N (0(D)) + (1 - t) 7JV (&). Thus | 7M (a) - %(a)\ < 
t\^ M (0(D)) - nf N (0(D))\ + (1 - t)\j M (b) - 7^(6)1 < te + (1 - t)e = e as needed. □ 
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2.10.5 Proof of Existence of Geometric Lifts of Interleav- 
ings, Part 3: Showing that H t o F s (X,-f M ) = M, Hi o 

F s (X, lN )^N 

Now it remains to show that H t o F S (X, j M ) = M, H t o F S (X, j N ) = N. We'll show 
that HioF s (X^ M ) — M; the argument that HioF s (X,^ N ) = N is essentially same. 

For a E M. n , let T a denote the subcomplex of X consisting of only those cells e such 
that 7jv/(0(Z)(e))) < a, where in this expression D(e) is the disk of X whose interior 
maps to e under tt. {J r a }aem. n defines a cellular filtration, which we'll denote T . Let 
X a = F S (X, 7Ar) a - It's easy to see that T a is a deformation retract of X a . Further, 
the inclusions of each JF a X a define a morphism x of filtrations; this morphism 
of filtrations maps under Hi to a morphism : H^J 7 ) — > Hi(F s (X,j M )) of 

-B n -persistence modules whose maps Hi(x)a '■ Hi(J- a ) —> Hi(X a ) are isomorphisms. 
Any homomorphism of I? n -persistence modules whose action on each homogeneous 
summand is a vector space isomorphism must be an isomorphism of -B n -persistence 
modules, so H^x) is an isomorphism. Thus, to prove that Hi o F S (X, 7a/) — M, it's 
enough to show that Hi{JF) = M. 

By Remark EZH H^F) = Hf w (F). 

Note that J 7 has the property that each cell e of X has a unique minimal grade of 
appearance grjr(e) in J 7 . Since each cell has a unique minimal grade of appearance, 
for any j E Z> , C™ '(J 7 ) is free: 

Cf W {7) = ©e.cxa.-cell B„(-<77>(e>')). 

The usual identification of j'-cells of X with a basis for Cf (X) extends in the obvious 
way to an identification of the j-cells of X with a basis for C^ w {J 7 )- 

Moreover, the boundary homomorphism 5f^_ l : Cf^[{X) — > Cf w (X) and the 
boundary homomorphism 5f +l : Cf_^[{J-) — > Cf w {T) are related in a simple way: 

Lemma 2.10.12. ^ x (e* +1 ) = E meW ^^(J)(^(et). ^(ej*- 1 ))^). 

Proof. Recall that we constructed X in such a way that for y E y, S^^e 1 ^ 1 ) = 
Yl w ew a wy e w- The result follows in a routine way from this expression for <5^i(e* +1 ) 
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and the definition of the boundary map 8f +v □ 

Now note that we have Sf = 0. If i ^ 1 this is follows from the fact that J 7 has 
no i — 1 cells. If % — 1, it is still true because of the isomorphism between cellular and 
singular persistent homology: we must have Cq W (J 7 ) = B n {—grjr(B)) = Hq^J 7 ) = 
H$ W {F), so 5t = 0. 

Therefore H? w (F) = Cf w (J r )/im(^ 1 ). 

The bijection which sends w G W to the cell e % w induces an isomorphism A : 
(WiUV^(-e)) ^Cf w {T). 

By the expression ( 12. 3 p for y in terms of a' wy given in Section I2.10.3j for y G 

yiuy 2 (-e), 

Thus 

= (iYl a 'wy¥cfw { T)(grA<)igrA4 +1 ))( e l)}y£y) 

wew 

= (iYl a ™yVc™{T){grA e w),9 r A e t l ))( e ™)}yzy) 
wew 

= im(Ci) 

by Lemma f2. 10.121 A therefore descends to an isomorphism between Cf w (F) / im.(5f +1 ) 
and (W 1 ,W 2 (-e))/(y 1 ,y 2 (-e)). This shows that Hf w (J 7 ) = M and thus completes 
the proof of Proposition 12. 10.41 in the special case that GCD(W) G W 1 . 

2.10.6 Proof of Existence of Geometric Lifts of Inter leav- 
ings, Part 4: Adapting the Proof to the Case That 

GCD(W) W 1 

As noted above, the proof of Proposition 12.10.41 we have given for the special case 
that GCD(\V) G M" adapts readily to a proof for the general case. We now outline 
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this adaptation, leaving to the reader the straightforward details. 

Let X' be the standard CW-complex structure on K. That is, we take each zGZ 
to be a 0-cell in X', and for each z G Z, we take the interval (z, z + 1) to be a 1-cell 



To carry out the proof for the general case, we first present a modified version 
of the construction of the CW-complex X given in Section 12.10.31 In this modified 
version, we construct the CW-complex X so that 

1. X' is a sub complex of X. 

2. X has an z-cell e l w for each u> G W. 

3. X has an (z + l)-cell e^ +1 for each y E~y. 

4. As a set, X = X' II {e^} weW II {e^ 1 }^. 

For r = (ri,...,r n ) G M n , let |_ r J = rnax{z G Z|z < Tj for 1 < j < n}. For all 
w G W, let the attaching map of e l w be the constant map to the 0-cell [gr(w)\ C X' . 

This defines X % . To complete the construction of X, it remains only to specify 
the attaching map a y : S l — >■ X 1 for each y G 3^- 

Let a„, y be defined as in Section 12.10.31 An analogue of Lemma 12.10.101 holds in 
our present setting and admits a similar proof. Invoking this result, for each y G y we 
choose o y : S % — > X 1 such that the CW-complex X constructed via these attaching 
maps satisfies 5f +1 (e y +1 ) = Y. w ew a ™y e w for each V e 3 ; - 

For r G X' = R, we define 7m(?") — 7Af( r ) = r l- As in the special case, we define 
the values of 7m and 7^ at the origin of the disks D l w and D l+1 as follows: 



As in the special case, we require the restriction of 7^ and 7jv to radial line 
segments of the disks D l w and D l y +1 to be linear. This completes the specification of 
7 M and 77V. 



in X'. 



. ForxG WiUtt, j M (0(D x )) 
• For x G W 2 U^ 2 , ^m(0(D x )) 



gr(x); 
gr(x(-e)). 



• For x G Wi U y*i, Jn(0(D x )) 

• For x G W2 U ^2, 7jv(0( j D ;c )) 



#r(x(-e)); 
5-r(x). 
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The argument of Lemma [2.10.111 shows that \\jm ~ In I loo — and the argument 
of Section f2. 10. 51 adapts in a straightforward way to show that Hi o F (X, jm) — M 
and Hi o F S (X, ^y N ) = N. □ 

2.11 Reducing the Evaluation of dj to Deciding 
Solvability of Quadratics 

Let M.Q{k) denote the set of multivariate systems of quadratic equations over the 
field k. 

Fix n e N and Let M and N be finitely presented _B n -persistence modules. Let 
q be the total number of generators and relations in a minimal presentation for M 
and in a minimal presentation for N. We show in this section that given minimal 
presentations for M and N, for any e > deciding whether M and N are e-interleaved 
is equivalent to deciding the solvability of an instance of M.Q{k) with 0(q 2 ) unknowns 
and 0(q 2 ) equations. 

We also show that dj must be equal to one of the elements of an order 0(q 2 ) 
subset of R>o defined in terms of the grades of generators and relations of M and N. 
Thus, by searching through these values, we can compute dj by deciding whether M 
and iV are e-interleaved for O(logg) values of e. That is, we can compute dj(M,N) 
by deciding the solvability of O(logg) instances of M.Q{k). 

If e.g. k is a field of prime order, a standard algorithm based on Grobner bases 
determines the solvability of systems in AiQ(k). M.Q(k) is NP-complete, however, 
and this algorithm is for general instances of A4Q(k) prohibitively inefficient. We 
leave it to future work to investigate the complexity and tractability in practice of 
deciding the solvability of systems in AiQ(k) arising from our reduction. 

In practice, we are interested in computing the interleaving distance between the 
simplicial persistent homology modules of two simplicial ra-filtrations. To apply the 
reduction presented here to this problem, we need a way of computing a presenta- 
tion of the multidimensional persistent homology module of a simplicial n-filtration; 
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strictly speaking, our reduction does not require that the presentations of our mod- 
ules be minimal. However, to minimize the number and size of the quadratic systems 
we need to consider in computing the interleaving distance via this reduction, we do 
want the presentations we compute to be minimal. 

We hope to address the problem of computing a minimal presentation of the 
simplicial persistent homology module of a simplicial n-filtration in future work. 

2.11.1 Linear Algebraic Representations of Homogeneous El- 
ements and Morphisms of Free ^-persistence Mod- 
ules 

Representing Homogeneous Elements of Free £? n -persistence Modules as 
Vectors 

Given a finitely generated free _£> n -persistence module F and an (ordered) basis B = 
bi, bi for F, we can represent a homogeneous element v G F as a pair ([v, B],gr(v)) 
where [v,B] G k l is a vector: if v = Y^i: g r(v)>gr(bi) atfF{gr{bi), gr(v))(bi), with each 
cij G k, then for 1 < i < I we define 



Remark 2.11.1. Note that for 1 < i < I, [bi, B] = e*, where e^ denotes the i th 
standard basis vector in k l . 

If V C F is a homogeneous set, we define [V, B] = {[v, B]\v G V}. 

Representing Morphisms of _B n -persistence Modules as Matrices 

Given finitely generated _B n -persistence modules F and F' and (ordered) bases B = 
and B' = b[, ...,b' m for F and F' respectively, let Mat k (B, B') denote the set 
of m x I matrices A with entries in k such that A^ = whenever gr(bj) < gr^b'^. 
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We can represent a morphism / G hom(F, F') as a matrix [/, B, B'\ G Matk(B, B'), 
where if /(&,-) = J2i: gr (b j )>gr(b' i ) a i3 i PF'(gr(b' i ), gribj))^, with each a*,,- G k, then 



Oij if gr{bj) > gr(bQ 
otherwise. 



Lemma 2.11.2. JTie map [-,5,5'] : hom(F, F') ->■ Mat k (B,B') is a bijection. 
Proof. The proof is straightforward. □ 

Note also the following additional properties of these matrix representations of 
morphisms between free i? n -modules: 

Lemma 2.11.3. Let F,F',F" be free B n -persistence modules with ordered bases 
B,B',B". 

(i) ///i,/ 2 ehom(F,F') then [f\ + f 2 , B, B'} = [f\, B, B'} + [f 2 , B, B'], 

(ii) Ifh G hom(F, F'), f 2 G hom(F', F") then [f 2 of u B, B"] = [f 2 , B', B"\ \f u B, B% 
(Hi) For any e > 0, [S(F, e), B, B(e)] = I\b\, where for m e N, I m denotes the mxm 

identity matrix. 

Proof. The proof of each of these results is straightforward. □ 
For a graded set W and u G R n , let W u = {y G W\gr(y) < u}. 

Lemma 2.11.4. If F 1 ,F 2 are free B n -persistence modules with bases B\,B 2 and W\ C 
F\,W 2 C F 2 are sets of homogeneous elements then a morphism f : F\ — > F 2 maps 
(Wi) into (W 2 ) iff [/, B u B 2 ][w, B,} G span[^| r(w) , B 2 \ for every w G W\. 

Proof. This is straightforward. □ 

2.11.2 Deciding Whether Two £> n -persistence Modules are e- 
interleaved is Equivalent to Deciding the Solvability 
of a System in A4Q(k) 

Let (Gm\Rm), (Gn\Rn) be presentations for finitely presented _B n -modules M and N, 
and assume the elements of each of the sets Gm, Gn, Rm, Rn are endowed with a total 
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order, which may be chosen arbitrarily. For a finite ordered set T and 1 < % < \T\, 
let Tj denote the i th element of T. 

We now define six matrices of variables, each with some of the variables con- 
strained to be 0. 



• Let A be an \Gn\ x \Gm\ matrix of variables, with = iff gr(GM,j 
gr{G Nyi ) + e. 

• Let B be an \Gm\ x \Gn\ matrix of variables, with B^- = iff gr(GNj 
gr(G M ,i) + e. 

• Let C be an \Rn\ x \Rm\ matrix of variables, with = iff gr(R M j 
gr(R N ,i) + e. 

• Let D be an \Rm\ x \Rn\ matrix of variables, with = iff gr{R^j 
gr(R M ,i) + e- 

• Let E be an \Rm\ x \Gm\ matrix of variables, with E^- = iff gr(GM,j 
gr(R M ,i) + 2e. 

• Let F be an \Rn\ x \Gn\ matrix of variables, with F^- = iff gr(GN, 
gr(R N;i ) + 2e. 



< 



< 



< 



< 



< 



< 



Let Tm denote the \Gm\ x |-Rm| matrix whose i column is [Rm,i, Gm] and let Tj 



iV 



denote the |Gjv| x \Rn\ matrix whose i column is [Rn^G 



Theorem 2.11.5. M and N are e-interleaved iff the multivariate system of quadratic 
equations 

AT M = T N C 

BTat = T M D 
BA - 7| Gm | = T M E 
AB — I\ Gn \ = T N F 

has a solution. 



Proof. To prove the result, we proceed in three steps. First, we observe that for 
any free covers (F M ,p M ) and (F N ,p N ) of M and N, the existence of e-interleaving 
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morphisms between M and N is equivalent to the existence of a pair of morphisms 
between F M and F N having certain properties. We then note that the existence of 
such maps is equivalent to the existence of two matrices, one in MoI^Gm-iGn) an d 
the other in Matk(GN ,Gm), having certain properties. Finally, we observe that the 
existence of such matrices is equivalent to the existence of a solution to the above 
multivariate system of quadratics. 

Let (F m ,Pm) and (F N ,p N ) be free covers of M and N. 

Lemma 2.11.6. M and N are e-interleaved iff there exist morphisms f : Fm — > -Pjv(e) 
and g : F^ — >■ Fm(z) such that 

1. /(ker(pAf)) C (ker(pjv))(e) ; 

2. g(kex(p N )) C (ker( / o M ))(e) ; 

3. gof-S(F M ,2e)c{ter{p M )){2e), 
I fog-S{F N ,2e) C (ker( P7V ))(2e). 

We'll call morphisms /, g satisfying the above properties e-interleaved lifts of 
the free covers (Fm,Pm) an d (Fn,Pn)- 

Proof. Let / : M — > N(e) and g : N — > M(e) be interleaving morphisms. Then by 
Lemma B.1.31 there exist lifts / : F M — > F N (e) and g : F N — > F M (e) of / and g. By 
the definition of a lift, / and g satisfy properties 1 and 2 in the statement of the 
lemma, g o f is a lift of g o f — S(M, 2e). S(Fm, 2e) is also a lift of S(M, 2e), so by 
the uniqueness up to homotopy of lifts (Lemma 12.1.31) . / and g satisfy property 3. 
The same argument shows that / and g satisfy property 4. 

The converse direction is straightforward; we omit the details. □ 

Now let F M = (G M ), F N = (G N ), and let p M : F M ->■ F M /(R M ), p N : F N -> 
Fn/ (Rn) be the quotient maps. Since the interleaving distance between two modules 
is an isomorphism invariant of the modules, we may assume without loss of generality 
that Fm/ (Rm) = M and F^j (Rn) = N. Then (Fm, Pm) and (Fn, pn) are free covers 
of M and N. 

Lemma 2.11.7. M and N are e-interleaved iff there exist matrices A e Matk(GM,GN) 
and B e Mat^G^i Gm) such that 
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1. A[w,G M ] G spa.n[R 9 x' 6 ,Gn] for all w G Rm, 

2. B[w, G N ] G span[ii!^ )+e , G M ] for all w G Rn , 

3. (BA - /| GM |)(e 4 ) G span[< (GM ' !)+2e , G M ] for 1 < i < |G M |, 
^ (AS - J| Gjv |)(e,) G span[< r(GiV -' )+2 \ GW] /or 1 < i < \G N \. 

Proof. By Lemma T2. 11.61 M and iV are e- interleaved iff there exists e-interleaved lifts 
/ : F M — > F N and g : F N — >■ F A/ of the free covers p M ) and (F/v, pjy). 

By Lemma 12.11.41 morphisms / : — > Fjy and g : Fjy — >■ -Fa/ , are e-interleaved 
lifts iff 

1. [f,G M ,G N (e)][w,G M ] G s P( m[R N (ey r W,G N (e)] for all w G i? M , 

2. [^,G w ,GAf(e)][w,G JV ] G span[i? M (e) sw(w) ,G M (e)] for all w G Fjv, 

3. [£o/- S(F M ,2e), G M ,G M {2e)\ [w,G M ] G span[ J R A/ (2e)^(-), G M (2e)] for all w G 

4. [/o£- S(F^,2e),G w ,G w (2e)][ W ,G w ] G span [i^ (2^), G N (2e)} for all «; G 
Gn- 

By Lemma [2. 11.31 



[Sf o / - 2e), G A /, GA/(2e)] = GW, G M (e)][/, G M , G>(e)] - I\g m \ 

and 

[/off- S(Fn, 2e), GW, Gjv(2e)] = [/, G M , GW(e)][& Gjy, G M (e)] - 

Also, by Remark |2.1 1.11 for 1 < i < \Gm\, [Gm,%,Gm] = e*, where is the i th 
standard basis vector in fc' GM L Similarly, for 1 < i < \Gn\, [Gn,uGn] = e*, where ej 
is the i t/l standard basis vector in /cl Gjv L 
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Finally, note that we have that 

[R N (e)^ w \G N (e)} 
[R M {e) 9w{w \G M {e)} 
[R M (2e)^ w \G M (2e)} 
[R N (2e)^ w \G N (2e)} 

Using all of these observations, Lemma \2 . 1 1 . 71 now follows from Lemma \2. 11.21 □ 

Finally, Theorem 12.11.51 follows from Lemma 12.11.71 by way of elementary matrix 
algebra and, in particular, the basic fact that for l,m G N and vectors v,V\, ...,vi in 
k m , v G span(t>i, vj) iff there exists a vector w G k l such that v = Vw, where V is 
the m x I matrix whose i th column is Wj. □ 

Remark 2.11.8. Note that the size of the system of quadratic equations in the 
statement of Theorem 12 . 1 1 . 51 is 0(q 2 ), where q is the total number of generators and 
relations in the presentations for M and N. For any e > 0, the system of quadratics 
has as few variables and equations as possible when the presentations for M and N 
are minimal. 

2.11.3 Determining Possible Values for di(M,N) 

Let M and iV be finitely presented B n -modules, and let U l M , U l N , Um, and Un be as 
defined at the beginning of Section 12.51 Let 

U m ,n = (J ({\x - y\} xe uit,yeujt U {^\x - y\} x .yeu\ { U - y\} x , y euf/J U {0, oo}. 

Note that \Um,n\ — 0(q 2 ), where as above q is the total number of generators and 
relations in a minimal presentation for M and a minimal presentation for N. 

Proposition 2.11.9. di(M,N) G U m ,n- 

Proof. Assume that for some e' > 0, e' ^ Um,n, M and N are e'-interleaved. Let e be 
the largest element of Um,n such that e' > e, and let 5 = e' — e. 



= [R% w+e , G N ] for all w G R M 
= [R g M (w)+€ , G M \ for all weR h 
= [RZ iw)+2 \G M ]for all weG 
= « r( " )+2e , G N ] for all w E Gj 
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We'll check that M, N, e and 5 satisfy the hypotheses of Lemma \2. 5. 71 The lemma 
then implies that M and N are e-interleaved. The result follows. 

By assumption, M and N are (e + 5)-interleaved, so the first hypothesis of 
Lemma \2 .5 .71 is satisfied. We'll show that the second hypothesis is satisfied; the proof 
that the third hypothesis is satisfied is the same as that for the second hypothesis. 

If z £ Um then for no i, 1 < i < n, can an element of U l N lie in (zj + e, Zi + e+ 5]; 
if, to the contrary, for some i there were an element u £ U l N D (z + e, z + e + S\, then 
we would have |u — ^| £ Um,n, and e < \u — Zi\ < e + 8, which contradicts the way 
we chose e and 5. Thus by Lemma 12. 5.4[ ^pn{z + e, z + e + 5) is an isomorphism. 

Similarly, for no i, 1 < i < n, can an element of U l M lie in (zi + 2e, Zj + 2e + 25]; if, 
to the contrary, for some i there were an element u £ U % M fl (z + 2e, z + 2e + 25], then 
we would have \\u — Zi\ £ Um,n, and e < — Zi\ < e + 5, which again contradicts 
the way we chose e and 5. By Lemma T2.5.4[ (fM{z + 2e, z + 2e + 2<5) is an isomorphism. 

Thus the second hypothesis of Lemma 12.5.71 is satisfied by our M, N, e,5, as we 
wanted to show. □ 

2.12 Discussion of Future Work on the Interleav- 
ing Distance 

Theorem 12. 4. 2\ Corollary 12. 5. 2} and Corollary 12.10.21 show that the interleaving dis- 
tance is a natural generalization of the bottleneck distance to the setting of multidi- 
mensional persistence. 

Insofar as the interleaving distance is in fact a good choice of distance on multidi- 
mensional persistence modules, the question of how to compute it is interesting and, 
it seems to us, potentially important from the standpoint of applications. The results 
of Section 12.111 suggest a path towards the development of a theory of computation 
of the interleaving distance. As noted in Section P2.ll} to exploit the connection with 
multivariate quadratics in the development of such a theory in practice, we need in 
particular a way of computing minimal presentations of simplicial homology modules 
of simplicial n-filtrations. We hope to address this problem in future work. 
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Corollary 12.10.21 demonstrates that the interleaving distance is optimal in the 
sense of Example 12.9.21 when k = Q or Z/pZ. However, our discussion of optimality 
of pseudometrics in Section |2"1?1 raises many more questions than it answers. Some of 
the more interesting questions are: 

1 . Can we extend the result of Theorem 12.10.11 to arbitrary ground fields? 

2. Can we extend the result of Theorem 12. 10. II to the case i = 0? 

3. Can we prove that the interleaving distance is 7?.-optimal for TZ any of the 
relative structures on obj*(5 ri -mod) defined in Examples 12. 9. 5112.9. 81 / The case 
of Example l2.9.6l is of particular interest to us. We have observed in Section l2.9.3l 
that in this case an 7?.-optimal pseudometric does exist. 

4. Can we obtain analogous results about the optimality of pseudometrics on more 
general types of persistent homology modules? For instance, can we prove a 
result analogous to Theorem 12.10.11 for levelset zigzag persistence [5]? 

An interesting question related to question H] above is whether there is a way of 
algebraically reformulating the bottleneck distance for zigzag persistence modules as 
an analogue of the interleaving distance in such way that the definition generalizes to 
a larger classes of quiver representations [17] . 

It seems quite likely that Theorem 12.4. 1[ the algebraic stability result of [8] , gen- 
eralizes to a theorem which quantifies the similarity between the persistence diagrams 
of a pair of tame ( J\, ^-interleaved -Bi-persistence modules. It would be nice to have 
a generalized algebraic stability theorem of this kind. 

Finally, we mention again that it would be nice to have an extension of Theo- 
rem 12.3.61 to a structure theorem for arbitrary tame -Bi-persistence modules, and an 
extension of Corollary 12.5.21 to well behaved tame S n -persistence modules. 



Chapter 3 



Strong and Weak Interleavings on 
Multidimensional Filtrations 

In this chapter we introduce and study weak and strong interleavings and interleaving 
distances on multidimensional filtrations. See Section 11.41 for an overview of the 
chapter. 

3.1 w-filtrations and their Persistent Homology 

In this section we define w-filtrations, a mild generalization of the n-filtrations intro- 
duced in Section [2. 7. 21 which allow for the coordinates of points in the index set of the 
spaces in a multidimensional filtration to have finite upper bounds. We also gener- 
alize the multidimensional persistent homology functor, introduced in Section I2.7.3[ 
to a category whose objects are u-filtrations. These generalizations will be useful in 
formulating our inference results in Chapter 4. 

3.1.1 w-Filtrations 

Fix n G N. For u G M. n , we now define the category u-filt of w-filtrations. 

Define a u-filtration X to be a collection of topological spaces {X a } a<u , together 
with a collection of continuous functions {(f>x{ a -,b) : X a — » X5} a <5 <u such that if 
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a < b < c < u then <p x {b,c) o cj) X (a,b) = 0x(o,c). We call the functions {(fix(a, b) : 
X a — > X b } a < h<u transition maps. 

Note that an 66 n -filtration is the same as an n-filtration, as defined in Section l2.7.2l 
To avoid ambiguity, we stipulate that in this thesis, whenever n G N, by n-filtration 
we will always mean an 66„-filtration, and not a u-filtration with u = n. 

Given two u-filtrations X and Y, we define a morphism / from X to Y to be a 
collection of continuous functions {f a } a <u '■ X a — > Y a such that for all a < b < u, 
fb ° 4>x{a, b) = 0y(a, b) o f a . This definition of morphism gives the w-filtrations the 
structure of a category. Let w-filt denote this category. 

For all vl < u, we can define the restriction functor R u i : u-6.lt — > w'-filt in the 
obvious way. (The dependence of R u * on u is implicit in our notation.) We'll use 
these functors heavily in what follows. 

3.1.2 The Multidimensional Persistent Homology Functor on 
w-filtrations 

Here we define the multidimensional persistent homology functor Hi : it-fiTt — > B n - 
mod for any u e R n , extending the definition introduced in Section ^. 7.3l for u = oo n . 

As in Section [2.7.31 for a topological space X and i G Z> , let Ci(X) denote the 
jth srri g U i ar cri ain module of X, with coefficients in k; for X, Y topological spaces and 
/ : X — > Y a continuous map, let f # : Ci(X) — > Ci(Y) denote the map induced by /. 

For X a -u-filtration, define Cj(X), the i th singular chain module of X, to be the 
-B n -persistence module for which 

• Ci(X) a = Ci(X a ) for all a < u, 

• Ci(X) a = for all a -ft u, 

• ip Cl (x){a, b) = 4>*(a, b) for b < u, 

• (p Ci ( X ){a,b) = for b u. 

Note that for any i G Z>o, the collection of boundary maps {5i : Ci(X a ) — > 
Ci^i(X a )} a<u induces a boundary map Si : Ci(X) — > C.j_i(X). These boundary maps 
give {Ci(X)} ie z >0 the structure of a chain complex. We define the Hi(X), the i th 
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persistent homology module of X, to be the i homology module of this complex. 
For X and Y two w-filtrations, a morphism / G hom(X, Y) induces in the obvious 
way a morphism Hi(f) : Hi(X) — > Hi(Y), making Hi : ii-fi.lt — >■ -B n -mod a functor. 

Compatibility of Multidimensional Persistent Homology With Restriction 
Functors 

For all « G 1", we define the restriction functor R u : B n -mod — > B n -mod by taking 
R U (M) of a -B n -persistence module M to be such that 

• R u (M) a = M a for all a < u, 

• R u (M) a = for all a u, 

• ¥Ru(M)(a, b) = ip M {a, b) for b < u, 

• ¥Ru(M)(a, b) = for b jt u. 

We define the action of R u on morphisms in the obvious way. 

Remark 3.1.1. Note that if M is a _B n -persistence module having a finite presenta- 
tion (G\R) (32], then for any u G M n , R U (M) has a presentation consisting of at most 
\G\ generators and \R\ + n\G\ relations. 

Lemma 3.1.2 (Compatibility of persistent homology with restriction functors). For 

any v! < u G W 1 , 

Hi o R u i = R u , o Hi. 

Proof. The proof is easy; we omit it. □ 

3.2 Strong Interleavings and the Strong Interleav- 
ing Distance on i^-Filtrations 

Overview 

In this section, we introduce strong interleavings and the strong interleaving distance 
on w-filtrations, and develop their basic theory; in Section 13.31 we will introduce and 
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study our homotopy theoretic variants of these, weak interleavings and the weak 
interleaving distance. 

As we noted in the introduction, our inference results in Chapter 4 are formulated 
using weak interleavings and the weak interleaving distance. In this sense, weak 
interleavings and the weak interleaving distance are of more interest to us than strong 
interleavings and the strong interleaving distance. Nevertheless, because the program 
of understanding weak interleavings is, at least in a loose sense, related to the (simpler) 
program of understanding strong interleavings, we choose to devote some effort here to 
writing down some basic results about strong interleavings and the strong interleaving 
distance. 

To begin this section, we define shift functors and transition morphisms for cat- 
egories of M-filtrations. Using these, we define strong ( J\, ^-interleavings of in- 
filtrations and the strong interleaving distance; the definitions are similar to those 
we introduced for S n -persistence modules in Chapter 2. Whereas we first introduced 
e-interleavings of _B n -persistence modules in Section 12.1.51 and then presented the 
more general definition of strong (Ji, ^-interleavings of _B n -persistence modules in 
Section 12.6. H we proceed here directly with the general definition of strong interleav- 
ings of M-filtrations. 

We present a characterization result for strongly (J 1; J 2 )-interleaved pairs of in- 
filtrations, for an important class of u-filtrations. We also present two optimality 
results for the strong interleaving distance. These are analogues of our optimality 
result Theorem 12.10.11 for the interleaving distance on 5 n -persistence modules. 

3.2.1 Definitions of Strong Interleavings and the Strong In- 
terleaving Distance 

Shift Functors 

Fix it G M. n and let J : R n — > M. n be an order-preserving map. We define the J- 
shift functor (•)(/) : w-filt— > J _1 (ii)-filt in a way analogous to the way we did for 
-B n -persistence modules: 

1. Action of (-)(J) on objects: For X G obj (it-flit) , we define X(J) by taking 
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X(J) a = X J{a) for all a < J 1 (u). We take the maps {(f> X (j)(a, &)} a <b<j-i(«) to 
be induced by those of X in the obvious way. 

2. Action of {-)(J) on morphisms: For X, Y G obj(w-filt) and / G hom(X, Y~), we 
define f(J) : X( J) — >■ K(J) to be the morphism for which f(J) a = fj( a ) f° r au 
a < J~ l {u). 

Note that the dependence of the definition of the functor (•)(</) on the choice of 
u is implicit in our notation. 

As in the case of _B n -persistence modules, for e G K we let (-)( e ) denote the functor 



Transition Morphisms 

For a u-filtration X and J : R n -»■ lR n increasing, let 5(A, J) : J R J -i (tt) (X) ^ A(J), 
the J-transition morphism, be the morphism whose restriction to X a is the map 

4>x{^i J( a )) f° r ah a < J~ l (u). 

For e G M> , let 5(X, e) = S(X, J € ). 

The expected analogue of Lemma 12.6.11 holds, with the same proof. 
Lemma 3.2.1. 

(i) For any f : X — > Y G hom(u-filt) and any J : lR n — > 1R™ increasing, 



(We)- 



S(Y,J)of = f(J)oS(X, J). 



(ii) For any J, J' increasing and u-filtration X 



S(X, J')(J) o S(X, J) = S(X, f o 



J). 



(Hi) For any J, J' increasing and u-filtration X 



S(X, J')(J) = S(X(J), J- 1 ofo 



J). 
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Strong ( Ji, J 2 )-Interleavings of w-filtrations 

For Ji, J 2 : R™ — > M. n increasing, we say that a pair (X, Y) of w-filtrations is strongly 
{Ji, J 2 )-interleaved (or, more colloquially, that X and Y are strongly (Ji, J 2 )- 
interleaved) if there exist morphisms / : Rj-i^(X) — > Y(Ji) and g : Rj-\^(Y) — > 
X(J 2 ) such that 

g{Ji) ° Rj-^oj-\ u )U) = S(X, J 2 o J x ) and 
f(J2)oR J ^ oJr , {u) (g) = S(Y,J 1 oJ 2 ); 

we say (f,g) is a pair of strong (Ji, J2)-interleaving morphisms for (X, y). 

Remark 3.2.2. When discussing (J 1; J 2 )-interleavings and (Ji, J 2 )-interleaving mor- 
phisms of u-filtrations, we will follow terminological conventions analogous to those 
introduced for (Ji, J 2 )-hiterleaved £> n -persistence modules in Remark [2.6.21 

If for e > 0, a pair of w-filtrations is strongly (J e , J e )-interleaved, we say that they 
are strongly e-interleaved. 

Since the definition of strong (Ji, J 2 )-interleavings is a bit complex, the reader 
may find it illuminating to see the definition in the special case of e-interleavings: For 
e G [0, 00), two u-filtrations X and Y are strongly e-interleaved if and only if there 
exist morphisms / : R u ^ t (X) — > Y(e) and g : R u _ e (Y) — y X(e) such that 

g(e)oR u _ 2e (f) = S(X,2e)and 
f(e)oR u _ 2t (g) = S(Y,2e). 

When u = 6b n the definition of e-interleavings simplifies yet further to give a 
definition closely analogous to that of e-interleavings of ^-persistence modules: For 
e G [0, 00), two n-filtrations X and Y are strongly e-interleaved if and only if there 
exist morphisms / : X — > Y(e) and g : Y — y X(e) such that 

g(e)of = S(X,2e) and 
f(e)og = S(Y,2e). 
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The Strong Interleaving Distance 

We define d$i '■ obj* (u-fi.lt) x obj* (w-fi.lt) — > [0, oo], the strong interleaving dis- 
tance, by taking 

dsi(X,Y) = inf{e G R> |X and Y are strongly e-interleaved}. 

3.2.2 Stability Results for the Strong Interleaving distance 

dsi satisfies stability properties with respect to the functors F s , F so , F SCe analogous 
to the stability properties Theorems l2.8.Hl2lT3l satisfied by the interleaving distance 
on S n -persistence modules with respect to the functors Hi o F s , Hi o F so , and Hi o 
F SCe . The formulation and proofs of these results are easy modifications of those of 
Section EU 

3.2.3 First Examples of ( Ji, J2)- inter leaved Filtrations 
Example 3.2.3. 

(i) For any 5 > and {X, Y, d, f) G obj(C 5 °), F so {X, Y, d, f), F s °-°p{X, Y, d, f) 
are strongly 5-interleaved. 

(ii) For any 5 > and (X, Y, d, f) G obj(C SCe ), F sde (X, Y, d, /), F SCe -°P(X, Y, d, f) 
are strongly 5-interleaved. 

Example 3.2.4. For n G N, let id n : W l — > R n denote the identity map. 

Let J\ : R — > ]R be given by Ji(x) = 2x. If (Y, d) is any metric space, X C Y, 
F R (X,d) is the usual Rips filtration on (X,d), and F Ce (X, Y, d) is the usual Cech 
filtration on (X,Y,d), then the inclusions (11. ip in Section [1.5.31 immediately imply 
that F R (X,d),F de (X,Y,d) are ( Ji, idi)-interleaved. 

To generalize this example, for n G N, let J n +\ '■ M n+1 — > M n+1 be given by 
Jn+i((a, &)) — (a, 26) for all a G M™, 6 6 1. Then it again follows immediately 
from the inclusions (11. ip in Section 11.5.31 that for any (X, Y,d, 7) G obj(C 5C ' e ), 
F 5R (X,d,7),F 56e (X,r,d,7) are (J^+i, id n+ i)-interleaved. 
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3.2.4 A Characterization of Strongly Interleaved Pairs of Mod- 
ules of Nesting Type 

We would like to have a transparent characterization of strongly (Ji, ^-interleaved 
pairs of -u-filtrations, as Theorem 12.6.41 gives us for (Ji, ^-interleaved pairs of B n - 
persistence modules. 

It is not clear how obtain such a characterization in full generality; the obstacle 
to adapting the approach of Theorem 12.6. 41 is that, at least naively, in the category of 
w-filt there is no reasonably behaved notion of free covers, as we have in the category 
-B n -mod. 

However, we present a characterization result, Theorem 13 .2 .7} for strongly ( J 1; J 2 )- 
interleaved pairs u-filtrations, provided we restrict attention to the u-filtrations in a 
full subcategory w-nest of w-filt which is large enough to contain all of the examples of 
■u-filtrations that we have occasion to consider in this thesis. In the special case that 
u = 6b n , this characterization is very simple and transparent (Corollary I3.2.9p . For 
general u, the characterization is not as transparent, but it does imply a transparent 
necessary condition for the existence of a strong (Ji, ^-interleaving between two 
u-filtrations of nested type, Corollary I3.2.8[ similar to our necessary and sufficient 
condition for n-filtrations. 

u-filtrations of Nested Type 

We define w-nest to be the full subcategory of w-filt whose objects are nitrations X 
such that for all a < b < u, the transition map 4>x{a, b) is injective. We refer to the 
objects of M-filt as w-filtrations of nested type^J 

As we will now see, we can regard u-filtrations of nested type as an n-parameter 
family of nested subspaces of an ambient topological space; our characterization of 
strongly (Ji, ^-interleaved w-filtrations follows from the adoption of this viewpoint. 

1 Note that in the literature on persistent homology, nitrations of nested type are typically the 
only kinds of nitrations defined and studied; there is not a huge loss of generality in restricting 
attention to these. 
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Colimits of w-filtrations 

Let Top denote the category of topological spaces. 

Define the diagonal functor A : Top — > u-filt to act on objects W in Top by 
taking A(W) a = W for all a < u and 4>A(w)( a ib) = idw for all a < b < u; we define 
the action of A on hom(Top) in the obvious way. 

It is well known that the usual category of topological spaces has all small colimits 
[331 [20]. 111 particular, we can define the colimit of a w-filtration X. This is a topological 
space colim(X) together with a morphism s x : X — > A(colim(X)) satisfying the 
following universal property: If W is a topological space and s' : X — > A(W) is a 
morphism, then there is a unique map t : colim(X) — > W such that s' = A(t) o s x . 

For a u-filtration X colim(X) set, the quotient of U a<u X a by the relation 

~ generated by taking x ~ (fix (a, b)(x), for all a < b < u and x G X a . We define 
s x so that for each a < u, s x takes each element of X a to its equivalence class in 
colim(X). We take colim(X) to be endowed with the final topology. This means that 
a set U G colim(X) is open if and only if (s^) _1 (f/) is open for all a < u. 

We can, in the obvious way, also define define an action of colim(-) on hom(-u-filt) 
to obtain a functor colim : -u-filt— > Top. 

Lemma 3.2.5. If X is a u-filtration of nested type then s x is infective for all a < u. 
Proof. The proof is straightforward; we leave it to the reader. □ 

Colimit Representations of w-Filtrations of Nested Type 

Let Sub(w) = {y\y < u}. Let A u denote the set of subsets S of Sub(w) such that if 
y G S and y < y' < u then y' G S. 

Let CL U denote the category whose objects are pairs (W, f), where W is a topo- 
logical space and / : W — > A u is a function. For (Wi,/i), (W%, fz) G obj(CL u ), 
let hom((Wi, fi), (W 2 , f2)) denote continuous maps g : W\ — > W 2 such that for all 
y g Wi, h(y) c f 2 (g(y)). 

We'll define a functor A : w-filt— > CL U such that A restricts to an equivalence 
between w-nest and its image under A. 
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First, for X a -u-filtration, we define a function ROA x '■ colim(X) — > A u by 

ROAx(x) = {y\y < u and x lies in the image of Sy}. 

We say that ROAx{x) is the region of appearance of x in X. We define A by 
taking A(X) = (colim(X), ROA x ) for any -u-filtration X and A(f) = colim(/) for 
all / G hom(-u-filt). 

We can define a functor filt : CL U — > -u-nest by taking filt((W, f)) a = {y G W\a G 
f(y)} and endowing filt((W^, /)) a with the subspace topology. The action of filt on 
hom(CL n ) is defined in the obvious way. 

Lemma 3.2.6. The restriction of A to -u-nest is equivalence of categories between 
u-nest and the full subcategory CL' U of CL U whose set of objects is the image of 
obj (-u-nest) under A; the restriction o/filt to CL' U is an inverse of the restriction of 
A to -u-nest. 

Proof. This is straightforward; we leave the details to the reader. □ 

Characterization of Interleaved pairs of Filtrations of Nested Type 

Now we are ready to offer our characterization of (J±, ^-interleaved pairs of u- 
filtrations of nested type. Roughly, it says that -u-filtrations X, Y are ( J±, ^-interleaved 
if and only if there is a pair of inverse homeomorphisms between "large" subspaces of 
colim(X) and colim(y) which is compatible with the data of ROAx and ROAy in an 
appropriate sense, and additionally these homeomorphisms admit certain extensions. 

Note that if u G M n , u' < u, and X is a -u-filtration, then colim(i? u /(X)) can be 
identified with a subspace of colim(X). 

Theorem 3.2.7. For J 1? J 2 increasing, a pair (X, Y) of u-filtrations of nested type 
is ( Ji, J 2 ) -interleaved if and only if there are maps 

f : colim(i? J -i( u ^(X)) — > colim(F) and 
g : coUm( J R J -i (u) (y)) -)• colim(X) 
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such that 

1. f and g restrict to inverse homeomorphisms between 

co)1vol{Rj-i j-i {u) {X)) Up(colim( J R J - loJ -i (u) (y))) and 
co\im(Rj-i oJ -i( u) (Y)) U /(colim(^ JrloJ - 1(u) (X))), 

2. J^ROAxiy)) C ROA Y (f(y)) for all y G colim^-^X)), and 

3. J 2 (ROA Y (y)) C ROA x (g(y)) for all y G colim^-i^^)). 

Proof. The proof is a straightforward application of Lemma 13.2.61 we leave the un- 
winding of definitions to the reader. □ 

The following immediate consequence of the last theorem gives our transpar- 
ent necessary condition for the existence of a (J\, ^-interleaving between two u- 
filtrations. 

Corollary 3.2.8. If a pair (X, Y) of u-filtrations of nested type is (Ji, J 2 ) -interleaved 
then there are sets D X ,D Y with 

colim( J R J -i oJ - 1(u) (X)) CD X C colim(X), 
colim( J R J -i oJr i (u) (r)) cD Y C cohm(F), 

and a homeomorphism f : Dx — > D Y such that 

1. J x (ROA x {y)) C ROA Y (f(y)) for all y G D x , and 

2. J 2 (ROA Y (y)) C ROA x {r\y)) for all y G D Y . 

When u = 6b n , Theorem 13 . 2 . 71 reduces to the following: 

Corollary 3.2.9. A pair (X,Y) of n-filtrations of nested type is (Ji, J 2 ) -interleaved 
if and only if there is a homeomorphism f : colim(X) — >• colim(F), such that 

1. J^ROAxiy)) C ROA Y (f(y)) for all y G colim(X), and 

2. J 2 (ROA Y (y)) C ROAxif-^y)) for all y G colim(Y). 
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3.2.5 Optimality Properties of the Strong Interleaving Dis- 
tance 

Using Corollary I3.2.9[ we now observe that dsi is optimal is a sense analogous to 
which we have proven di to be optimal in Section 12.101 We also show that dj is 
7?.-optimal, for 1Z a relative structure defined in terms of dsi- 

The results here, while perhaps not especially surprising in light of the optimality 
results of Section I2.10[ are of significance in that they show that optimality theory 
for dsi behaves as one would hope, given those results. 

Theorem 3.2.10. Let TZ^ = (T,X-r, fr) be the relative structure on obj*(n-filt) (as 
defined in Section [Oj) such that T is the set of topological spaces, Xt = obj (C-f ) for 
all T eT, d T = d^, and fa is given by fr(T, g) = F S (T, g). Then dsi is 1Z±- optimal. 

Proof. We have observed above that dsi is 7^4-stable. By essentially the same argu- 
ment by which Theorem 12.10.11 follows from Proposition 12.10.41 to prove the result 
it's enough to show that if two n-filtrations X, Y 6 im(/7-) are strongly e-interleaved 
then there exists a topological space T and functions 7x,7v : T — > E n such that 
F s (T, lx ) = X, F 5 (T, 7y ) = Y, and sup, eT - f 2 (y)\\oo < e. This follows from 

Corollary [3X1 □ 

Theorem 3.2.11. For i 6 N, let IZ^^ be the relative structure on obj*(S„-mod) 
such that T is the singleton set s, X s = obj (n-filtj, d s = dsi, and f s is given by 
f s (X) = Hi(X). If k = Q or k = Z/pZ for some prime p then di is IZ*,^- optimal. 

Proof. Cleary di is l^j-stable. Optimality follows from Proposition 12.10.41 in essen- 
tially the same way that Theorem 12.10.11 follows from the same proposition. □ 
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3.3 Weak Interleavings and the Weak Interelaving 
Distance on w-Filtrations 

Overview 

We now turn to the definition and basic theory of weak interelavings and the weak 
interleaving distance. 

To begin, we prepare for the definition of weak interleavings by reviewing local- 
ization of categories and defining the localization of the category of -u-filtrations with 
respect to levelwise homotopy equivalences. We observe that the shift and restriction 
functors descend to the localized category. Using the descents of these functors, we 
define weak (J 1; J 2 )-interleavings and the weak interleaving distance. 

After proving some basic facts about weak interleavings, we show that persistent 
homology is stable with respect to the weak (Ji, J 2 )-interleavings on -u-filtrations and 
{J\, ^-interleavings on £?„-persistence modules. This result will be very useful to 
us for passing from our results about -u-filtrations to corresponding results about 
persistent homology modules. 

We finish the section by posing some questions about the weak interleaving dis- 
tance which we hope to answer in future work. 

3.3.1 Localization of Categories 

Localization of categories is a standard construction in homotopy theory. It is anal- 
ogous to the localization of rings and modules studied in commutative algebra. 

Definition 

Let C be a category and let W C hom(C) be a class of morphisms. A localization of 
C with respect to W is a category C^IU -1 ] together with a functor r : C — > ClW^ 1 ] 
such that 

(i) r(/) is an isomorphism for each / e W. 
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(ii) [universality] Whenever G : C — > D is a functor carrying elements of W to 
isomorphisms, there exists a unique functor G' : C[W _1 ] — > D such that G'oT = 
G. 

Condition (ii) guarantees that if T : C — > D and r' : C — > D' are two localizations of 
C with respect to W, then there is a unique isomorphism of categories O : D — > D' 
such that 9 o T = V. We will often denote CfP^ -1 ] as Ho(C), suppressing the 
dependence of this category on W. 

If C is a small category, meaning that obj(C) and hom(C) are each sets (as 
opposed to proper classes [31]), then a localization of C with respect to any set of 
morphisms W C hom(C) exists — see [Hi Section 42.4] for a simple proof of this, 
using the existence of pushouts in the category of small categories [33] . 

The localization of a category, when it exists, admits an explicit construction — see 
[T9[ Section 42.7]. From the form of this construction we obtain the following lemma. 

Lemma 3.3.1. For any category C, W C hom(C) there is a localization T : C — > 
C[W _1 ] such that obj(C[W /_1 ]) = obj(C) and hom(C[W / ~ 1 ]) is generated under com- 
position by the images under T of elements of hom(C) together with the inverses of 
the images under T of elements of W. 

The Connection Between Localization and Closed Model Categories 

Let W C hom(Top) denote the homotopy equivalences. Then it can be shown that 
TopJW -1 ] is isomorphic to the usual homotopy category of topological spaces (i.e. 
the category having the same objects as Top and homotopy classes of continuous 
maps as morphisms) [20] , 

In fact, there is a very general setting in which a localization can be constructed 
in a homotopy theoretic fashion — this is the setting of closed model categories, as 
introduced by the 1967 monograph of Daniel Quillen [37]. A closed model category 
is a category together with certain extra structure that allows for the axiomatic 
development of homotopy theory in that category. In more detail, a closed model 
category C is a category together with three distinguished classes of morphisms called 
weak equivalences, fibrations, and cofibrations. The category is required to have the 
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property that all finite limits and colimits exist [20], and the weak equivalences, 
fibrations, and cofibrations are required to satisfy certain conditions. Let W denote 
the weak equivalences of C. Quillen showed that the closed model structure on C 
allows for a homotopy theoretic construction of C[W -1 ] generalizing the construction 
of the usual homotopy category of topological spaces. For more details, see the 
expository articles [201 EE] . 

We will not need to appeal to the theory of closed model categories in this thesis, 
and indeed it is not clear that the particular localization we consider in our definition 
of weak interleavings admits an interpretation in terms of closed model categories. 
However, a very closely related notion of localization does admit such an interpreta- 
tion; see Remark l3.3.2l In any case, it may be useful for the reader to keep in mind the 
connection between localization and homotopy theory provided by Quillen's theory, 
even if we do not take advantage of it here. As the reader will see in Section I3.3.7[ 
there are important questions about weak interleavings which this thesis raises but 
does not answer, and it is quite plausible that closed model categories and axiomatic 
homotopy theory will be useful in addressing these questions. 

Localization of w-filt with Respect to Levelwise Homotopy Equivalences 

For any u G M™, let W C hom(tt-filt) denote the levelwise homotopy equivalences, the 
morphisms / for which f a is a homotopy equivalence for all a < u. In what follows, 
we'll work only with localizations of tt-fi.lt with respect to W. Thus from now on, 
Ho(u-filt) will always denote w-filt[W -1 ], and T : it-filt — > Ho{u-^\\i) will denote the 
localization functor; the dependence of V on u will be implicit. 

Remark 3.3.2. I am not aware of any way of endowing u-filt with the structure 
of a closed model category having W the class of weak equivalences. However, let 
W C hom(u-filt) denote the levelwise weak homotopy equivalences, the morphisms / 
for which f a is a weak homotopy equivalence [2H] for all a < u. It is known that w-fi.lt 
can be given the structure of a closed model category having W as the class of weak 
equivalences; see the discussion at the end of [201 Section 10] and the references given 
there. 
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3.3.2 Definitions of Weak Interleavings and the Weak Inter- 
leaving Distance 

Shift Functors and Restriction Functors in the Homotopy Category 

By the universal property of the localization of a category, for any u G R n and 
J : R" — > R™ order-preserving, the functor (•)(</) '■ M-filt — > J _1 (tt)-filt descends to a 
functor from Ho(u-G\t) to Ho^^ 1 (w)-filt). By abuse of notation, we also write this 
functor as (-)(J)- Note that 

ro(.)(j) = (.)(J)or. 

Similarly, by the universal property of the localization of a category, for any u' < 
u G R n the functor R u > : w-filt — >■ w'-filt descends to a functor from ifo(u-filt) to 
Ho(u'-filt). By abuse of notation, we also write this functor as R u >. Note that 

T o R u i = R u , o r. 

Commutativity in the Localized Category of Transition Morphisms with 
Arbitrary Morphisms 

The following is an analogue of Lemma [3.2. l( i) in the category ifo(u-filt). It is the 
key step in our proofs of Lemmas 13.3.61 and 13.3.71 below. 

Lemma 3.3.3. For any u G R n , u-filtrations X andY , morphism f G homHo(u-mt)(X,Y) , 
and J increasing, we have that 

f(J)oV(S(X, J)) = V(S(Y,J))of. 

Proof. By Lemma [3.3.11 we can write / = f\ o ... o f] where for each i, either /j = T(f) 
for some / G hom(C) or fa = r(/) _1 for some / G W. 

Note that f\J) — fi(J) o ... o fi(J). Thus to prove the result it's enough to check 
that for feW, 

r(/)- 1 ( J) o S(X, J) = S(Y, J) o T(f~)-\ 
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From the equation 



f(J)oS(X, J) = S(Y,J)of, 



we have that 



r(f(j))oT(s(x,j)) = r(s(Y,j))or(f). 



Composing both sides of the equation on the left by T(f) 1 (J) and on the right by 
r(/) _1 , we obtain that 



r(f)-\j) o r(s(x, J)) = t(S(y, J)) o r(/) 



-i 



as desired. 



□ 



Weak ( Ji, J 2 )-interleavings of u-filtrations 

For Jx, J 2 : R n — > W 1 increasing, we say that an ordered pair of M-filtrations (X, Y) is 
weakly (Ji, J2)-interleaved if there exist morphisms / : r(Rj-i, u JX)) — > T(Y(Ji)) 



we say (/, g) is a pair of weak (Jx, J2)-interleaving morphisms for (X, Y). 
For e > 0, we define weak e-interleavings in the expected way. 

The Weak Interleaving Distance on w-filtrations 

We define dwi '■ obj*(it-filt) xobj*(w-filt) — > [0, oo], the weak interleaving distance, 
by taking 



and g : r(Rj-i (u) (Y)) ->■ T(X(J 2 )) such that 



g{J x )oR 
f(J 2 )oR 



T ^j-Hu)(f) = ns(x,j 2 oj 1 )) 

^ j-i iu) (g)=T(S(Y 1 J 1 oJ 2 )); 



dwi(X,Y) 



inf{e G R>o|^ and Y are weakly e-interleaved}. 



It follows from Lemma l3.3.7( ii) below that dwi is a pseudometric on obj*(w-filt). 
It is easily checked that this pseudometric is not a metric. 
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3.3.3 Stability Results for the Weak Interleaving Distance 

dwi satisfies stability properties with respect to the functors F s , F so , F SR , and F SCe 
analogous to the stability properties Theorems 12.8. 1412.8.41 satisfied by the interleaving 
distance on _B n -persistence modules with respect to the functors Hi o F s , Hi o F so , 
Hi o F SR , and Hi o p SCe _ The formulation and proofs of these results are easy 
modifications of those of Theorems 12.8. 14l2T8~74l analogues of Theorems 12.8. 1412T8T51 
follow via Lemma f3 . 3 .41 below from the analogues of these theorems for dsi mentioned 
in Section 13.2.11 The analogue of Theorem 12.8.41 for dwi is proved in essentially the 
same way as Theorem 12.8.41 but requires Proposition |4.2.lT i). a lift of the persistent 
nerve theorem of [13] to the level of nitrations. 

3.3.4 Basic Results about Weak Interleavings 

Lemma 3.3.4. If u-filtrations X,Y are strongly (Ji, J 2 ) -interleaved then X,Y are 
weakly (Ji, J 2 ) -interleaved. 

Proof. It's easy to check that if f,g are strong ( Ji, ^-interleaving morphisms for 
X,Y then T(f),T(g) are weak (Ji, ^-interleaving morphisms for X,Y. We leave 
the details to the reader. □ 

Lemma 3.3.5. For any u' < u G M n and (J\, J 2) G W 1 , if u-filtrations X,Y are 
strongly (weakly) (Ji, J 2) -interleaved, then R U /(X), R u i{Y) are also strongly (weakly) 
(Ji, J 2) -interleaved. 

Proof. It is easy to check that if /, g are strong (weak) ( J 1; J 2 )-interleaving morphisms 
for X, Y, then R u '(f), R u '(g) are strong (weak) ( J 1; J 2 )-interleavings for R U /(X), R U >(Y) 

□ 

To prove our consistency results, we will require the following lemma. 

Lemma 3.3.6. For any u' < u G lR n and e > 0, if X,Y are u-filtrations such that 
R U /(X), R u i{Y) are strongly (weakly) e-interleaved then X,Y are strongly (weakly) 
(Je+u-u', Je+u-u>) -interleaved. 
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Proof. Let / : R u '-e(X) -> R u >(Y)(e) and g : i^QO -> i? M ,(X)(e) be strong e- 
interleaving homomorphisms for i? u /(X), Note that R u >(X)(e) = R u i- e (X(e)) 

and i2 u /(F)(e) = R u /_ t (Y(e)). Thus the homomorphisms S(Y(e),J u - u r) o f and 
5'(X(e), J u - U i) o g are well defined, and it's easy to check that these are strong 
(J e+U - U i, J e+u __ u /)-interleaving homomorphisms for X,Y. 

Invoking Lemma 13.3.31 the same argument also gives the result for weak inter- 
leavings. □ 

The next lemma will prove very useful to us. For u = (ux, u n ),u' = (u[, u' n ) G 
W 1 , let gcd(w, u') = (min(wi, u[), min(w n , u' n )). 

Lemma 3.3.7 (Triangle Inequality for (Jx, J 2 )-interleavings). 

(i) Suppose we are given u G W 1 , Jx, J2, J3, J4 : R n R™ increasing, Ux,u 2 < u, 
and u-filtrations X±, X 2 , X3 such that R Ul (X 1 ), R U1 (X 2 ) are strongly (weakly) 
(Ji, J 2) -interleaved and R U2 (X 2 ) , R U2 (X 3 ) are strongly (weakly) (J 3 , J 4 ) -interleaved. 
Then i? gc d( ul)U2 )(^i), -Rgcd(«i,« 2 )(^3) are strongly (weakly) (J$oJx, J 2 o J 4) -interleaved. 

(ii) In particular, if we have t\,t 2 > such that X\,X 2 are strongly (weakly) ex- 
interleaved and X 2 ,X 3 are strongly (weakly) e 2 -interleaved, then Xx,X 3 are 
strongly (weakly) (ex + e 2 ) -interleaved. 

Proof. By Lemma I3.3.5[ it suffices to assume that U\ = u 2 = u. It's easy to check 
that if fx, gi are strong (Jx, ^-interleaving homomorphisms for Xx, X 2 and f 2 , g 2 are 
strong (J 3 , J 4 )-interleaving homomorphisms for X 2 , X 3 then f 2 (Jx) fi,9i{Ji) °9i are 
strong (J 3 o Jx, J 2 o J 4 )-interleaving homomorphisms for X 1; X 3 . This gives (i) for 
strong interleavings. The case of weak interleavings follows via the same argument, 
using Lemma [3.3.31 □ 

We noted in Remark 12.1.11 that if e < e' G R and two _B n -persistence modules 
are e-interleaved, then they are e'-interleaved. Here is the analogous statement for 
(■A, ^-interleavings, on the level of nitrations. (The same proof gives corresponding 
result for (Jx, ^-interleaved -B n -persistence modules.) 

For J, J' : W 1 -> W 1 bijections, say that J < J' if J(a) < J'(a) for all a G W 1 . 
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Lemma 3.3.8. Let J\, J[, J 2 , J 2 be increasing and J x < J[, J 2 < J' 2 . If u-filtrations 
X, Y are strongly (weakly) ( J 1; J 2 ) -interleaved then X, Y are strongly (weakly) (J[, J 2 )- 
interleaved. 

Proof. Since J\, and J[ are increasing, there is an increasing map J" : R n — > W 1 such 
that J[ = J" o J x . Similarly, there is an increasing map J' 2 ' : W 1 — > M. n such that 
J 2 = J'2 o J 2 . Let f,g be strong (Ji, J 2 )-interleaving morphisms for X, Y. We show 
that S(Y, J'{){J\) /, S(X, J 2 )(J2) 9 are strong (J{, J^-interleaving momorphisms 
for X, Y. By Lemma E2U 

S(X, J2){J 2 )(J[) o g(J[) o S(Y, J1){J X ) o f 
= S(X, 4){J 2 o J[) o g {j' x ) o 5(y(Ji), Jf 1 oJ[)of 
= S(X, J 2 ')(J 2 o J() o & (J() o /(Jf 1 o JO o 5(X, Jf 1 o J() 
= S(X, J^)(J 2 o J{) o S(X, J 2 o J 1 )(Jf 1 o J() o S(X, Jf 1 o JQ 

= s(x, j 2 )(j 2 o jo o s(x, j 2 oj lQ j- 1 o jo 

= S(X, J' 2 oJ 2 o JO 

= s(x,j^o jo. 

The symmetric argument shows that 

S(Y, J'OiWz) o /(JO o S(X, 4){J 2 ) o 5 = S(Y, J[ o JO. 

This gives the result for strong interleavings; the result follows for weak interleavings 
by the same argument, using Lemma [3.3.31 □ 

3.3.5 Stability of Persistent Homology with Respect to In- 
terleavings 

In this section, we prove that persistent homology is stable with respect to weak 
interleavings on w-filtrations and interleavings on B n -persistence modules. 

Theorem 3.3.9. For any i G Z> , u E W 1 , and Ji, J 2 : M" — > M™ increasing, 
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if u-filtrations X,Y are weakly (J 1; J 2 ) -interleaved then Hi(X),Hi(Y) are (J\, J2)- 
interleaved. 

Corollary 3.3.10. For any i G Z> 0; u G W 1 , and u-filtrations X and Y , 

d^H^X^Hity)) <d WI (X,Y). 

Proof of Theorem \3. 3.9l We begin with a couple of lemmas. 
Lemma 3.3.11. For any i G Z>o and J : W n — > IR n order-preserving, 

H t o(.)(J) = (.)(J)oH l . 

Proof. We leave the straightforward proof to the reader. □ 

Note that if / G hom(5 n -mod) is such that f a is an isomorphism for all a G M n , 
then / is an isomorphism. Thus, since the singular homology functor maps homotopy 
equivalences to isomorphisms, Hi takes elements of W to isomorphisms in B n -mod. 
Then by the universal property of localizations, for any u G W 1 , there's a functor 
Hi : Ho(u-6\t) — ¥ 5 n -mod such that 

H^HiO r. 

Lemma 3.3.12. For any i G Z> and J increasing, 

H i o(-)(J) = (.)(J)oH i . 

Proof. Hi o (-)(J) takes elements of W to isomorphisms in B n -mod. Thus by the 
universal property of localization there exists a unique functor G : Ho(u-61t) —¥ Bn- 
mod such that G o T = Hi o (•)( J). We have that 

Hi o (-)(J) o T = Hi o T o (-)(J) = Hi o (-)(J) = (.)( J) o = («)(J) o F 4 o r. 



Thus by the uniqueness property of G, Hi o (•)(</) = G — (-)(J) o ifj. 



□ 
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Now let / : r(i2^-i (lt) (X)) -> IWO) and <? : r( J R J - 1(u) (F)) T(X(J 2 )) be 
weak (Ji, J 2 )-inter leaving morphisms for X, y. 
The homomorphisms 

Hi(g) ■ Hi(Rj-i^(Y)) — »• Hi(X){J 2 ) 

extend to homomorphisms f E : Hi(X) — )■ Hi(Y)(Ji) and : H^Y) — > H,i(X)(J 2 ) by 
taking to be identically zero on all homogenous summands Hi(X) a of Hi(X) such 
that a 5^ J-f and taking gf^ to be identically zero on all homogenous summands 
Hi(Y) a of Hi(Y) such that a ft J 2 l (u). 

We'll now show that f E , are ( Ji, J 2 )-interleaving homomorphisms for Hi(X), Hi(Y). 
We have that 

BifoXJO o ^i(i2^-i OJr x (u) (/)) = fl,(y(Ji)) o ^( J R Ji - loJ - 1(M) (/)) 
= HiGy(Ji) o % loJ - 1(u) (/)) = o T(S(X, J 2 o J x )) 
= Hi(S(X, J 2 o JO) = S(Hi(X), J 2 o Ji). 

This implies that for a < J-f 1 o J 2 ~ 1 (u), 

(g E (Ji) o f E ) a = (Hi(g)(Ji) o H t (Rj-r OJ - 1{u) (f))) a = smx), J 2 O Ji).. 

On the other hand, for a -ft Jf o (iz), 

o = = J 2 o J x ) a . 



o 



Thus, Qe(Ji) ° /e = S(Hi(X), J 2 o J\). The symmetric argument shows that /e(J 2 
fl'E = S(Hi(Y), J\ o J 2 ). This shows that f E ,g E are ( Ji, J 2 ) -interleaving homomor- 
phisms for Hi(X), Hi(Y), as desired. □ 

Example 3.3.13. By Example 13.2.41 and Theorem 13.3.91 we have that for any i > 
and (X, Y, d, 7) G obj(C SCe ), the ^-persistence modules HioF R (X, d),H i oF C!e {X, Y, d) 
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are (j7i, idi)-interleaved, and the 5 n+ i-persistence modules Hi o F SR (X,d, r y), Hi o 
F SCe (X, Y, d, 7) are (J n +i, id n+ i)-interleaved. 

3.3.6 On Our Choice of Definition of Weak Interleavings 

To define weak interleavings, we have considered the localization of -u-filt with respect 
to levelwise weak equivalences. An alternative approach, and one worth understand- 
ing because it is very simple, is to define weak interleavings by passing to a category of 
nitrations over the homotopy category of topological spaces. We take a moment here 
to explain this alternative approach to defining weak interleavings and the weak in- 
terleaving distance and to discuss the relationship between the distance thus obtained 
and dwi- 

For u G W 1 , let -u-filt* be the category whose objects are collections of topological 
spaces {X a } a<u together with morphisms {(p x (a,b) G hom jf/o(Top) (X a , X b )} a < b<u such 
that if a < b < c < u then (f>x(b, c) o (fix(a, b) = (px(a, c). 

Given two objects X and Y of -u-filt*, we define a morphism / G hom u _ fi i t * (X, Y) to 
be a collection of morphisms {f a G hom^ ( Top )(X a , Y a )} a<u such that for all a < b < u, 
f b o (j) X (a,b) = (j) Y (a,b) o f a . 

The usual functor T' : Top — > ifo(Top) induces a functor T' : -u-filt—)- M-filt*. 
This functor takes levelwise weak equivalences to isomorphisms. Further, we can 
define shift and restriction functors on the categories -u-filt* just as we have for the 
categories u-filt. Thus for any u G W 1 we have all the structure we need to formulate 
an alternative definition of weak (Ji, J 2 )-interleavings of -u-filtrations via the functor 
T' analogous to the one we have formulated via the localization functor T. We'll call 
the weak interleavings thus defined A-weak interleavings. The definition of A-weak 
e-interleavings induces a definition of an A-weak interleaving distance on u-filtrations 
which we denote d^wi- 

By the universal property of localization, we have a functor : if o(-u-filt)— > -u-filt* 
such that r" = o T. It follows that if two nitrations are weakly (J 1; J 2 )-interleaved, 
then they are A-weakly (J 1? J 2 )-interleaved, and in particular, d^wi < dwi- This 
implies that all of our inference results in this thesis which are formulated in terms of 
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weak interleavings are still true if instead formulated in terms of A-weak interleavings. 

We would like the distance with which we develop our inferential theory to be 
as sensitive as possible, subject to the conditions that it has reasonable stability 
properties (such as those mentioned in Section 13.3.21) and that the distance between 
nitrations X and Y is whenever there exists a levelwise homotopy equivalence 
/ : X — > Y. Thus the fact that d^wi — dwi offers some justification for our working 
with dwi rather than d^wi- 

I do not yet know if it is in fact true that dAwi — dwi, though I suspect that this 
equality does not hold in general. More generally, we can ask the following: Under 
what circumstances does the existence of an A-weak (Ji, ^-interleaving between 
two w-filtrations imply the existence of a weak (Ji, J 2 )-interleaving between the two 
nitrations? This question is closely related to the problem of understanding the 
relationship between a homotopy category of diagrams of topological spaces of a 
given shape and the category of diagrams of the same shape over the homotopy 
category of topological spaces. It is well known that these two categories are usually 
not equivalent [201 Remark 10.3], and in fact there are results in the homotopy theory 
literature which quantify the difference between the two categories [15]. We leave it 
to the future to study how those results bear on the question of when dAwi — dwi- 

3.3.7 The Theory of Weak Interleavings and the Optimality 
of dwi- Open Questions 

To close this section, we present some questions regarding theoretical properties of 
weak interleavings and the interleaving distance. Answers to these questions would, 
it seems, offer us a satisfactory understanding of weak interleavings and the weak 
interleaving distance. 

1. Can we use homotopy colimits to give an analogue for weak interleavings of our 
characterization Theorem 13.2.71 .'' 

2. In particular, can we give a transparent topological characterization of weakly 
0-interleaved pairs of nitrations of nested type analogous to the characterization 
[29l Corollary 0.21] of pairs of homotopy equivalent topological spaces as pairs 
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of deformation retracts of a common embedding space? 

3. If w-filtrations X,Y are weakly (J 1; J 2 )-interleaved, do there exist strongly 
(Ji, J 2 )-interleaved infiltrations X',Y' with d WI (X,X') = d WI {Y,Y') = 0? 

4. Is dwi optimal in the sense that d < dwi for any pseudometric d on obj (it-fi.lt) 
such that d < d$i and d(X, Y) — whenever X, Y are weakly 0-interleaved? A 
positive answer to this question would provide strong justification for the use 
of weak interleavings in the development of the theory of topological inference. 
Note that a positive answer to the last question would imply a positive answer 
to this question. 

5. A question already raised in Section 13.3.61 Under what circumstances does the 
existence of an A- weak ( J 1; J 2 )-interleaving for a pair of it-filtrations imply the 
existence of a weak (J\, ^-interleaving for the pair? 

6. Can an arbitrary it-filtration be well approximated (with respect to dwi) by a 
M-filtration of nesting type (e.g. by mapping telescope type constructions)? 



Chapter 4 



Approximation and Inference 
Results for Multidimensional 
Filtrations 

In this chapter we apply the interleaving machinery introduced and studied in the 
previous two chapters to formulate and prove topological inference results for multi- 
dimensional filtrations. See Section 11.51 for an overview of the chapter. 

4.1 Inference Preliminaries 
4.1.1 Basic Notation 

Thoughtout this chapter, fix p G [1, oo] and m G N. Let d p denote the L p metric on 
R m . 

For (X, d) a metric space and X' C X, we'll often abuse notation slightly and let 
d also denote the restriction of d to X' . 

If (X, d) is a metric space, x G X, and r G M>o, we let B d (x, r) denote the closed 
metric ball of radius r centered at x. That is, Bd(x,r) = {x' G X\d(x,x') < r}. 

Let (X, d) be a metric space, and S C X. We say a set L C X is an e-sample of 
S (w.r.t d) if for any s G S, there exists some / G L such that c?(s, I) < e. 
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If (X, dx) and (Y, dy) are metric spaces, a function 7 : X — > Y is said to be 
c-Lipschitz if for all Xi,x 2 G X, d Y (7(0:1), 7(^2)) < c • dxixi-,^2)- In this thesis, the 
codomain (Y, dy) of a Lipschitz function will always be (M. n , d°°) for some n. When 
we want to make explicit the metric d on the domain of a Lipchitz function /, we will 
refer to / as a c-Lipchitz function (w.r.t. d). 

4.1.2 Riemannian Manifolds and Probability Density Func- 
tions 

In this thesis Riemannian manifolds will always be understood to be manifolds with 
boundary. 

Let M be a Riemannian manifold of dimension I. The Riemannian structure on M 
induces a metric d M on M, the geodesic metric. In turn, the geodesic metric induces 
a measure % l M on (M,Bm), the I -dimensional Hausdorff measure (Here Bm is 
the Borel a-algebra of M). From now on we'll write % l M as Hm] I will be implicit in 
this notation. 

When M = R m , endowed with the standard Euclidean metric, T-Lm is the usual 
Lebesgue measure on R n . 

For y G M and r > 0, we say a ball B = B d M(y, r) is strongly convex if for every 
pair of points y',y" in the closure of B, there exists a unique shortest path in M 
between y' and y", and the interior of this path is included in B. Define p(M), the 
strong convexity radius of M, by 

p(M) = inf sup{r\B d Ai(y,r) is strongly convex} 

yeM r>0 

As noted in [llj, p(M) is positive when M is compact. When M is a Euclidean space, 
p(M) = 00. 

A density function on M is a i^M-measurable function 7 : M — > [0, 00) such that 
J M 7 dT^A/ = 1. A density function 7 defines a probability measure P 7 on M with the 
property that for any A G £? M , ^(A) = J A 'jd'HM- 
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4.1.3 Density Estimators 

As we explained in the introduction, the superlevelset-Rips nitrations we consider in 
the formation of our consistency results are filtered by density estimators. We review 
here some basic concepts and results related to density estimation that we will need 
in what follows. Some of the material here also appeared in Section ll.2.2( we include 
that material again here for readability's sake. 

Let M be a Riemannian manifold and let T>{M) be the set of density functions 
on M. We'll define a density estimator E on M to be a sequence of functions {E z : 
M z -> V(M)} zeN such that for any y £ M, the function E g>y : M z — >■ K defined by 
E z , y {T) = (E z (T))(y) is measurable. By slight abuse of terminology, we'll also refer 
to the individual functions E z as density estimators. 

We formulate our results in terms of pairs (E, 7) of density estimators and density 
functions satisfying one of two properties. Let T z be a random sample of P 7 of size 
z. The first property is: 

Al: E Z (T Z ) converges uniformly in probability to 7. 

In stating the second property we'll assume that (E, 7) is defined on R m for some 
m G N. The property is: 

A2: E Z (T Z ) converges uniformly in probability to the convolution of 7 with some 
kernel function K. 

Al is known to hold for kernel density estimators on Euclidean spaces, for a wide 
class of kernels and density functions 7, provided the kernel width tends to at an 
appropriate rate as z — > 00 [27]. Further, Pelletier has shown that the notion of 
kernel density estimators extends to Riemannian manifolds [3B|, and a recent article 
by Henry and Rodriguez [30] shows that under mild assumptions on (E, 7) and a 
similar condition on the rate at which the bandwidth of the kernel tends to as 
z — > 00, assumption Al holds for the estimators defined by Pelletier. In fact, each of 
the cited results gives a.s. uniform convergence of E Z (T Z ) to 7. 

A2 also is known to hold for kernel density estimators E with kernel K on Eu- 
clidean spaces, for a wide class of kernels K and density functions 7, when the band- 
width of the estimator E z is held fixed as z varies [38l Proposition 9]. 
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Let 7 : M — > R be a density function, 7 : M — > R be another density function, 
g G (0, 1], C > 0, z G N, and E z : M z ^ V(M) be a density estimator. We'll say 
that E z is a (g, C)-density estimator of 7 w.r.t. V 1 if for T 2 a finite i.i.d. random 
sample of P 7 of size z, 

P(sup\E z (T z )(l)-^(l)\>C)<q. 

When E z is a (g, C)-density estimator of 7 w.r.t. "P 7 , we'll say simply that E z is 
a (g, C)-density estimator of 7. 

The two cases of interest to us will be where, in the above definition, 7 = 7 and, 
in the special case that M = R m , 7 = 7 * K for a kernel K. 

Lemma 4.1.1. 

(i) If M is a Riemannian manifold, 7 : M — > R is a density function, and E is 
density estimator on M such that (E,j) satisfies Al, then for any C > and 
q G (0, 1] there exists z such that for all z > z , E z is a (q, C)-density estimator 
ofl- 

(ii) Ifj: R m — > R is a density function and E is density estimator on R m such that 
(E,j) satisfies A2 for some kernel K, then for any C G (0, 00) and q G (0, 1] 
there exists z such that for all z > z , E z is a (q,C)-density estimator of 
7 * K w.r.t. "P 7 . 

Proof. This is immediate from the definitions. □ 

4.2 Deterministic Approximation of Multidimen- 
sional Filtrations via Discrete Filtrations 

Overview 

In this section, we prove two theorems concerning the deterministic topological ap- 
proximation of sublevelset-offset filtrations via discrete filtrations. The first theorem 
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concerns approximation by sublevelset-Cech nitrations; the second theorem concerns 
approximation by sublevelset-Rips nitrations. 

These results are multidimensional analogues of [121 Theorem 3.1] and its exten- 
sion [HI Theorem 4.5] to good samplings of sublevelsets of 7. They hold on the level 
of nitrations rather than merely on the level of persistent homology modules. By 
Theorem 13. 3. 9} we also obtain analogues of these results on the level of persistent 
homology modules. 

In addition to our results on the topological approximation of sublevelset-offset 
nitrations, we present results on the deterministic approximation of multidimensional 
sublevelset nitrations and their persistent homology via Cech and Rips complexes 
with fixed scale parameter. 

4.2.1 Weak O-interleavings of Open Sublevelset-offset and Open 
Sublevelset-Cech Filtrations 

To prove our approximation results, we need the following proposition, which of- 
fers some motivation for our definition of weak interleavings of u-filtrations; see Re- 
mark 14.2.21 below. 

Proposition 4.2.1. 

(%) For any X C R m and 7 : X -»■ R n , R^^ o F so ~° p (X, R m , <P, 7) and Ro6 n+1 o 

F SCe ~° p (X, R m , d p , 7) are weakly O-interleaved. 
(ii) Let M be a Riemannian manifold. For any X C M and 7 : X — > R n , 
R{db n AM)) F so ~° p (X, M, d M , 7) and R^ nA M)) ° F SCe ~° p (X, M, d M , 7) are 
weakly O-interleaved. 

Note that in (ii), if p(M) = then all spaces of each of the two filtrations are the 
empty set, so in this case (ii) holds vacuously. 

Proof. We give the proof for (ii); the proof of (i) is the same. 

The same argument that Chazal and Oudot use to prove the persistent nerve 
lemma [13] gives us that there exists an (66 n , p(M))-filtration Z (a "Mayer- Vietoris 
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blowup nitration," to borrow the terminology of |47]) such that there are level- 
wise homotopy equivalences / : Z — » R(6b n , p (M)) ° F so ~° p (X, M, d M , 7), and g : 
J R(o6„,p( A /))0^ F SCe -°P(X, M, d M , 7) in hom((oo n , p(M))-filt). Thus ToR {o&nAM)) o 
F S0 ~ 0p (X,M,d M ,-?) and T o R {An AM )) o F SCe ~° p (X, M, d M , 7) are isomorphic in 
Ho((6b n , p(M))-filt), and hence are weakly O-interleaved. □ 

Remark 4.2.2. Note that for M,X, and 7 defined as in the statement of Proposi- 
tion EOTEIii), it needn't be true that R {o6nAM ))oF so -° p (X, M, d M , 7) and R io 6 n , P (M))° 
F e P (X, M, d M , 7) are strongly O-interleaved. For example, Let M = [0,1] (en- 
dowed with the Euclidean metric), X = {0,1}, and define 7 : {0, 1} — > E by 
7(0) = 7(1) = 0. Then p(M) = 00. It's straightforward to check that there can 
be no pair of strong O-interleaving homomorphisms between F P (X, M, d M , 7) 
and F SCe -° p (X, M, d M ', 7). 

Nevertheless, it is natural to think of R^ n AM ^oF so ~° p (X 1 M, d M , 7) and R(6b n ,p(M))° 
F SCe ~° p (X, M, d M , 7) as being "topologically equivalent." This motivates a choice of 
pseudometric on n-filtrations with respect to which w-filtrations which are isomorphic 
in ifo(u-filt) are distance from one another. 

4.2.2 Topological Approximation of Sub levelset- Offset Fil- 
iations via Sublevelset-Cech Filtrations 

We state our first approximation result, Theorem 14.2 .3[ in two parts. It is easily seen 
that each is a special case of a more general result, but the fully general form of the 
result is not particularly interesting and it seems more expedient to just state the two 
special cases of interest separately. 

Theorem 14.2.3( 1) says that for any W C R m , Lipchitz function 7 : W — > W 1 , 
T C W and 7 : T — > M n , F SCe (T, M m , d p , 7) gives a good topological approximation 
to F s (W, M. m , d p , 7) when the Hausdorff distance between T and W is small and 
sup; GT ||7(0 — 7(011 is a l so small. 

In fact, Theorem I4.2.3( i) says more generally that for any u G M. n , R u ,oo 
F SCe (T, R m , d p , 7) gives a good topological approximation to R Ut00 oF so (W, R m , d p , 7) 
when the Hausdorff distance between T fl 7„ and 7„ is small and sup igT — 7(011 
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is also small. 

Theorem 14.2.3( h) gives a variant of this result for Riemannian manifolds. 
Theorem 4.2.3. 

(i) Let W be a subset ofR m , let 7 : W — )■ M n be a c-Lipschitz function (w.r.t 
d p ) for some c > 0, let d = max(l,c), and let u G R n . Let T C W be a 4- 
sample of 7„ (w.r.t d p ). Let C G M>o and 7 : T — > ¥L n be a function such that 

~ 7(011°° < C f° r al1 1 e T - Then 

dwi(R(u,oo) o F so (W, M m , d p } 7), i? (U|0o) o F sde (T, R m , d p , 7)) < e + C. 

(iij Lei M be a Riemannian manifold and let 7 : M — )■ M n 6e a c-Lipschitz function 
(w.r.t d M ) for some c > 0, let d = max(l,c), and let u G R n . Let T C M be 
a 4 -sample of 7„ (w.r.t d M ). Le£ C G M>o and 7 : T — >■ M n fre a function such 
that ||7(/) - 7(7) Hoc < C /or a// I G T. I%en, 

dwi(R{u, P (M)) ° F so (M, d M , 7), %, P{ M)) o F sde (T, M, d M , 7)) < e + C. 

Proof. We present the proof of (i); the proof of (ii) is essentially the same as the proof 
of (i), using Proposition 14.2. lT ii) in place of Proposition 14.2. lT i) . 

We begin with a lemma. Let 7r : T — > R n denote the restriction of 7 to T. 

Lemma 4.2.4. R {u>oo) oF so {W, R m , d p , 7) and R {Ut0o) oF so {T, R m , d p , 7 T ) are strongly 
e-interleaved. 

Proof. For any (a, b) 6R"xl, let 7 ajb denote F so (W, R m , gP>, 7)^ and let A a , 6 denote 
F so (T, R m , d p , 7r)(o,6)- Consider some (a, 6) < (w, 00) and p G 7 0i &. There is a point 
p' G 7a with d p (p,p') < b. Since T is an 4-sample of 7 a , there is a point p" G T with 
d p {p",p') < Thus d p {p",p) <^ + b. Since 7 is c-Lipschitz, 7 (p") <a+f<a + e. 
Hence p G A a+e0+ 4. Therefore 7 a & C \ a + e b+A- C \ a +eb+t- The inclusions thus define 
a morphism / : % j0o) o F so (W, R m , d?, 7) R {u ,oo) ° ^ SO (T, M m , d?, Tr )(e). 
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Now let p G X a> b- Then there is a point p'eT with d p (p,p') < b and < a. 
Thus p G 7 aj fe. Therefore X a ^ C 7 ai t C 7 a+<E ,f>+ e . The inclusions thus define a morphism 
<7 : % i00) o F so (T, R"\ 7T ) -> i? (Ui00) o F so (^, M m , d", 7 )(e). 

Clearly, / and g are a pair of strong interleaving e-interleaving homomorphisms. 
Thus R {Ut0o) oF so {W, R m , d p , 7) and R (u>oo) oF so (T, R m , dP, j T ) are strongly e-interleaved, 
as desired. □ 

We now observe that we have a chain of interleaving relationships between nitra- 
tions: 

• LemmaELltells us that R {Ui0o) oF so {W, R m , d?, 7) and R (u>oo) oF so (T, R m , dP, j T ) 
are strongly e-interleaved. 

• By Example E2M}), for any 5 > 0, F so {T, R m , d p , j T ) and F S °-° P (T, R m , d p , j T ) 
are strongly 5-interleaved. 

• By Proposition gXDTj, F S °-° P (T, R m , d p , j T ) and F SCe ~ 0p (T, R m , d p , j T ) are 
weakly 0-interleaved. 

• By Example E23(ii), F SCe -° p (T, R m , dP, jr) and F s0e (T, R m , d p , j T ) are strongly 
5-interleaved. 

• F sGe (T,R m ,d p ^ T ) and F sGe (T, R m , d p , 7) are strongly C-interleaved. 

Applying Lemmas 13 . 3 . 41 and 13 .3.71 several times gives us that R( u ,oo)°F (W, R m , d p , 7) 
and R (Ut0o) o F sGe (T, R m , d p , 7) are (e + C + 25)-interleaved. Since this holds for all 
5 > 0, 

dwi(R(u,oo) o F so (W, R m , d p , 7), R M o F SR (T, R m , dP, 7)) < e + C, 

as we wanted to show. □ 

Remark 4.2.5. Theorem 14.2.31 can be strengthened somewhat using the language of 
{J\i </2)-interleavings, but the strengthening is not especially interesting, so we choose 
to frame the result using the simpler language of e-interleavings. 

Corollary 4.2.6. For any i > 0, 
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(i) under the same assumptions as in Theorem 4-2. 3{ i), 



d^Hi o R {U)00) o F so (W, R m , dP, j),H i0 R [u>oo) o F SCe (T, R m , d?, j)) < e + C. 



(ii) under the same assumptions as in Theorem 4.2.3 (H), 



d^Hi o R MM )) o F so (M, d M , 7), Hi o R(u, p (M)) o F SCe (T, M, d M , 7)) < e + C. 

Proof. This follows immediately from Theorem 13.3.91 □ 

4.2.3 Topological Approximation of Sub levelset- Offset Fil- 
iations via Sublevelset-Rips Filtrations 

Our second main result of this section, Theorem l4.2.71 is an analogue of Theorem l4.2.3l 
for sublevelset-Rips filtrations rather than sub levelset- Cech filtrations. We formulate 
the result using weak ( J 1; J 2 )-i n terleavings. 

For (Ji, J2) increasing, we'll say that two w-filtrations X and Y are strongly 
(weakly) almost (Ji, J2)-interleaved if for all e > 0, X and Y are strongly (weakly) 
(J e o J 1; J e o J 2 )-interleaved. Similarly, we'll say that two -B„-persistence modules M 
and N are almost ( Ji, ^-interleaved if for all e > 0, M and N are ( J e o J 1; J e o J 2 )- 
interleaved. 

For the statement theorem, recall that in Example 13.2.41 we defined the map 
J n+1 : M rt+1 M rt+1 by J n+1 ((a, b)) = (a, 2b) for a e M n , b G R. 

Theorem 4.2.7. 

(i) Under the same assumptions as in Theorem \4-2. 3( %), R( U) oo) °F so (W, R m , d p , 7) 
and R( u ,oo) F SR (T, d p , 7) are almost weakly (J e +c, Jn+i J e +c) -interleaved. 

(ii) Under the same assumptions as in Theorem \4 . 2. 3\ ii ), R\ u ,p(M)) °F so (M, d , 7) 
and R(u, p (m)) F SR (T, d M , 7) are almost weakly (J e +c, Jn+i Je+c) -interleaved. 

Proof. We present the proof of (i); the proof of (ii) is essentially the same as the proof 
of (i), using Theorem 14.2 .3( ii) in place of Theorem 14.2. 3( i). 
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By Theorem l2X3T i). for any 5 > R (u>oo) o F so {W,R m ,d p ,-f) and i2 (lt)0o) o 
F sGe (T, R m , d p , 7) are (e + C + <5)-interleaved. By Example BXjl F sCe (T, R m , d p , 7) 
and F SR (T, d p , 7) are (id n+ i, J7n+i) -interleaved. Thus by Lemmas 13.3.51 and 13.3.71 
R(u,oo) o F 5 °(iy,R m ,rfP, 7 ) and R (u>oo) o F 5/? (T, cP>, 7) are ( J e+c+< 5, Jn+i ° Je+c+a)- 
interleaved. Thus by Lemma 13.3.81 they are also ( J25 Je+c, J25 Jn+i ^e+c)- 
interleaved. The result follows. □ 

Remark 4.2.8. Theorem 14.2.71 can be tightened by tightening the result of Theo- 
rem 14.2.31 on which it depends; see Remark 14.2.51 

Corollary 4.2.9. For any i > 0, 



(i) under the same assumptions as in Theorem \4 .2.3 (i), HioR^ u oo ^oF so (W, M m , d p , 7) 
and Hi o i?( njOC ) o F SR (T, d p , 7) are almost (J e+ c, Jn+i J t +c)-'>' n t er l eave d. 



(ii) under the same assumptions as in Theorem \4 . 2. 'J( ii ), HioR^ u p ^ M ^oF so (M, d , 7) 
and Hi o R(u, p (m)) F (T, d M , 7) are almost (J e +c, Jn+i Je+c) -interleaved. 

4.2 .4 Approximating Multidimensional Sublevelset Persistence 
via Discrete Filtrations with Fixed Scale Parameter 

We now observe that the deterministic result [T2"| Theorem 3.7], on the approximation 
of sublevelset persistence of R- valued functions via filtered Rips complexes with fixed 
scale parameter, admits a straightforward generalization R n - valued functions, n > 1. 
This generalization, Theorem 14.2 .10[ is proven in the same way as [121 Theorem 3.7], 
using the interleaving distance in place of the bottleneck distance. 

Unlike the results of the previous sections, this result gives an approximation only 
on the level of persistent homology and not one on the level of filtrations. However, 
we prove a variant of the result, Theorem 14.2. 1 H formulated using filtered Cech 
complexes with fixed scale parameter instead of filtered Rips complexes with fixed 
scale parameter, which does hold on the level of filtrations. This latter result is new 
even for 1-D filtrations; we will use it in Section l4~5~l to present a result, Theorem 14.5.21 
on the topological inference of the superlevelset filtration of a density function 7 via 
a filtered Cech complex with fixed scale parameter built on i.i.d. samples of V 1 . 
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Given a finite metric space (X, d, f) G C SR , and 5 > 0, let F SR (X, d, f, 5) de- 
note the n- filtration obtained by fixing the last parameter in the (n + l)-filtration 
F SR {X, d, f) to be 5. For u G M", i G Z> , and 5 2 > 6 1 > 0, let 

H?(X, d, f, 5 h 5 2 ) cHioRuO F SR (X, d, f, 5 2 ) 

be the image of the map Hi(j), where 

j : R u o F SR (X, d, f, 5,) i? M o d, /, 5 2 ) 

is the inclusion. 

[TT| Theorem 5.1] was stated for geodesic metrics on Riemannian manifolds, but 
the analogous result holds for LP metrics on W 71 , 1 < p < oo. Similarly, we state our 
generalization for geodesic metrics on Riemannian manifolds, but an analogue holds 
for L p metrics as well. 

Theorem 4.2.10. Let M be a Riemannian manifold and let 7 : M — )■ MJ 1 be a c- 

Lipschitz function (w.r.t d hl ). For u G MJ 1 , let T be an e-sample of ^ u (w.r.t. d M ). 
Let 7 : T ->■ R n be such that - 7 (/)| < C for all I G T. If e < ^ then for any 
i G Z> and5e [2e, 

d/(^(T, d M , 7, 5, 25), ^ o R u o F 5 (M, 7)) < 2c5 + C. 

Proof. As noted above, the proof of [12, Theorem 3.7] adapts directly. The one 
difference is that [121 Theorem 3.7] is formulated in terms of "open" variants of the 
sublevelset filtration and the sublevelset-Rips filtration, analogous to the open variant 
of the sublevelset-offset filtration presented in Section 12.7.41 Thus to adapt the proof 
of [121 Theorem 3.7] to our setting we need to appeal to example 13.2.31 

We note also that using our module-theoretic definition of interleavings, the proof 
of [T2"} Theorem 3.7] may be written down in terms of module homomorphisms rather 
than in terms of maps of vector spaces, as in [12] . This makes the proof somewhat 
less cumbersome. □ 
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Now we now formulate our variant of Theorem 14.2.101 for Cech nitrations. As for 
Theorem 14.2.101 we state the result for geodesic metrics on Riemannian manifolds, 
but an analogue holds for the LP metric as well. 

For (X,Y,d,f) E C SCe and 5 > 0, let F sde (X, Y, d, f, 5) denote the n-filtration 
obtained by fixing the last parameter in the (n + l)-filtration F (X, Y, d, /) to be 
5. 

Theorem 4.2.11. Let M be a compact Riemannian manifold and 7 : M — > K n be a 

c- Lip schitz function (w.r.t d M ). For u G W 1 , let T be an e-sample of ^ u (w.r.t. d M ). 
Let 7 : T -»> W 1 be such that \ - -y(Z) | < C for all I e T. Then for any 5 > e, 

d WI (R u o F sde (T, M, d M } 7, 8), R u o F S (M, 7)) < cS + C. 

Proof. The proof is similar to the proof of Theorem 14.2.31 We leave the details to the 
reader. □ 

4.3 Bounds on the Probability that an I.I.D. Sam- 
ple of a Manifold is an e-sample of a Super- 
levelset 

Overview 

[Tl| Section 4.2] presents a bound on the probability that an i.i.d. sample of a 
Riemannian manifold M with density 7 is an e-sample (w.r.t d M ) of a superlevelset 
of 7. 

Here we recall that bound and apply it to obtain a bound on the probability that 
an i.i.d. sample of a submanifold M of M m with density 7 is an e-sample (w.r.t d p ) 
of a superlevelset of 7. 

In the next section, we will use these bounds to prove our main inference results. 
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Definitions and Results 



Let M be a Riemannian manifold. Let A be a subset of M. For reR, let 



V(A,r) = mfU(B d M(x,r)). 



Let J\f(A,r) G N U {00} be the r -covering number of A — that is, Af(A,r) is the 
minimum number of closed <i-balls of the same radius r needed to cover A (the balls 
do not need to be centered in A). 

The following lemma is jTH Lemma 4.3]. 

Lemma 4.3.1. Let M be a Riemannian manifold, 7 : M — > R be a c-Lipschitz 
probability density function (w.r.t d M ), and d = max(l,c). LetT z be an i.i.d. sample 
ofV 1 of size z. Then for any e > and a > ^, T z is an ^-sample of — 7_ Q (w.r.t. 
to d M ) with probability at least 1 - N{-j- a , ^e'^-fM-i-a, ^ . 

Remark 4.3.2. As noted in [IT] . Lemma 14.3. II tells us that if Af(— 7- a , ^7) is positive 
and V(— 7-q, ^7) is finite for every e > then the Hausdorff distance between T z D 
— 7_ a and — 7_ Q converges in probability to as z — )■ 00. 

Remark 4.3.3. It is observed in [11] that J\T{— 7- a , e) is positive and V(— 7_ a ,e) 
is finite for any a > and e > 0, provided M is compact or, more generally, the 
sectional curvature of M is bounded above and below. 

Now we extend Lemma 14.3.11 to the case of L p metrics on submanifolds of M. m . 
For 1 < p < 00, let 



Lemma 4.3.4. Let M be a submanifold ofM. m , 7 : M — > R be a c-Lipschitz probability 
density function (w.r.t d p ), let d = max(l, c), and let T z be an i.i.d. sample ofV 1 of 
size z. Then for any e > and a > ^ , T z is an 4 -sample of — 7_ a (w.r.t. d p ) with 




v/m ifpe[l,2), 
1 if p G [2, 00]. 



probability at least 1 — J\f(— 7. 



«' 2c'K(p) 



e -z(a-f)V(- 7 . 



' 2c'K(p) 
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Proof. For any y 1,1/2 G M, 

d p (yi, 2/2) < K{p) d 2 { Vl , y 2 ) < K(p) d M { Vl , y 2 ). (4.1) 

Thus a c-Lipschitz function w.r.t. d p is a cif (p)-Lipschitz density function (w.r.t d M ). 

Let c" = max(l, cK(p)). Lemma 14.3.11 tells us that for 7 : M — > K a cK(p)- 
Lipschitz probability density function (w.r.t d ), T z an i.i.d. sample of P 7 of size z, 
e' > 0, and a > cKi f) e , T z is an py-sample of — 7_ a (w.r.t. d M ) with probability at 
least 1 - A/"(-7-«, ^ 7 ) e -^ a - £ ^) v (-^— 2^). 

Now substituting e' = Jx^) m ^ ne a bove statement, we obtain for any e > 
0, a > ^r, T z is an e /^^ -sample of M (w.r.t. to c? ) with probability at least 
1 - A/"(-7_ a , 2^y)e^ (Q_ ^ )V( ^ Q '^w ) . Thus by (jO)l . with at least the same 
probability T 2 is an ^--sample of M (w.r.t. c? p ), as we wanted to show. □ 

Note that remark 14.3.21 adapts immediately to the setting of Lemma 14.3.41 

4.4 Inference of the Superlevelset-offset Bifiltra- 
tion of a Density Function via Discrete Bifil- 
trations Built on I.I.D. Samples 

We're now ready to present our results on the inference of the superlevelset-offset 
filtration of a density function 7 on a Riemannian manifold from an i.i.d. sample of 
Vry. These are the culmination of the theory we have developed so far in this thesis. 

We first present results for inference using superlevelset-Cech nitrations. The- 
orem 14.4.11 shows that with high probability, superlevelset-Cech nitrations built on 
sufficiently large i.i.d. samples of V 1 (and filtered by the superlevelsets of a well 
behaved density estimator) give good approximations to the superlevelset-offset fil- 
tration of 7, with respect to dwi- Theorem 14.4.21 an asymptotic form of this result, 
then tells us that such superlevelset-Cech nitrations are consistent estimators (w.r.t. 
dwi) of the superlevelset-offset filtration of 7. 
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After presenting our results for superlevelset-Cech filtraitons, we present ana- 
logues of each these results for superlevelset-Rips nitrations. The main result is The- 
orem 14. 4T4"l which shows that in the large sample limit, the superlevelset-Rips filtration 
built on an i.i.d. sample of "P 7 (and filtered by the superlevelsets of a well behaved 
density estimator) and the superlevelset-offset filtration of 7 are weakly (Jn+i, id n +i)- 
interleaved. Since as observed in Example 13.2.44 superlevelset-Rips and superlevelset- 
Cech filtration functors built on the same input data are always strongly (J~ n +i, id n+ i)- 
interleaved, Theorem 14.4.41 says that in the large sample limit, the superlevelset-Rips 
and superlevelset-Offset nitrations satisfy in the weak sense the same interleaving 
relationship that superlevelset-Rips and superlevelset-Cech nitrations always satisfy 
in the strong sense. 

4.4.1 Inference Results for Superlevelset-Cech Bifiltrations 

Theorem 4.4.1 (Inference of superlevelset-offset nitrations via superlevelset-Cech 
nitrations (finite sample case)). Let M be a Riemannian manifold, 7 : M — > R be 
a density function, T z be a finite i.i.d. random sample of P 7 of size z, and E z be a 
(q, C) -density estimator of another density function 7 : M — > R for some C > and 
qe [0,1]. 

(%) If M is an embedded submanifold of R m , 7 is c-Lipschitz w.r.t. d p for some 
c> 0, and d = max(l, c) then for any e > and a > 

dwi(R(-«,oo)°F sGe (T z , R m , d p , -E Z (T Z )), J R ( _ a , oo) oF 50 (M, R m , dP, -7)) < e+C 

with probability at least 1 - Af(-j_ a , ^j^)e~ z{a ~ ce/c ' )V{ ~ 1 - a '^p> ) - q. 

(ii) If ^ is c-Lipschitz w.r.t. d M for some c > 0, and d = max(l,c) ; then for any 
e > and a > Sf, 

dwi(R{-a, P (M))oF sGe (T z , M, d M , -E Z (T Z )), R^ aAM)) oF so {M, d M , -7)) < e+C 
with probability at least 1 - jV(-7_ a , _i 7 ) e -^ a - ce / c ') v (-7— _ g . 
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Proof. To prove (i), first note that in the event that 

{sup \j(l) - EJT z )(l)\ <C}n {T z is an --sample of M (w.r.t. d p )}, 

Theorem I4.2.3( i) applies to give that 

dwi(R(- a ,oo) o F sde (T z , R, d p , -E t (T z )), # ( _ a)00) o F so (M, R m , d p , -7)) < e + C. 

By Lemma T4.3.44 the definition of a (q, C)-density estimator, and the union bound, 
the probability of this event is at least l-N{--f- a , ^) e ~< a -^/ c 'M-f- a ,±r) _ q This 
gives (i). 

The proof of (ii) is the same as that of (i), using Lemma 14.3.11 in place of 
Lemma [4.3.41 and Theorem I4.2.3( ii) in place of Theorem I4.2.3( i). □ 

Theorem 4.4.2 (Consistent estimation of superlevelset-offset filtration via super- 
levelset-Cech nitrations). Let M be Riemannian manifold with sectional curvature 
bounded above and below, let 7 : M — > R be a density function, and let T z be an i.i.d. 
sample ofV^ of size z. 

(i) If M is a submanifold ofM. m , 7 is c-Lipchitz (w.r.t. d p ) for some c > and E 
is a density estimator on M such that (E, 7) satisfies Al then 



(ii) If M = M. m , 7 is c-Lipchitz (w.r.t. d p ) for some c > and E is a density 
estimator on M m such that (E, 7) satisfies A2 for some kernel K then 



d W i(R {0 ,oo) o F sde (T z , R m , dF, —E Z (T Z )), R (0tOo) o F so (R m , d p , -7 * K)) A 0. 



(Hi) If 7 : M — > R is c-Lipschitz (w.r.t. d M ) for some c > and E is a density 
estimator such that (E, 7) satisfies Al then 



d WI (R {0A M))°F sde (T z ,M,d M ,-E z (T z )),R i0AM)) oF™ A 0. 



dwi(R(o,oo) F 



>SCe 



(T 2 ,M m ,^,-E 2 (T 2 )), J R ( o, oo) oF 



so 



(M,R m ,cF,- 7 )) AO. 
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Proof. To prove (i), we need to show that for any e > and q G (0, 1], there exists 
some Z\ G N such that for all z > Z\, 

dwi(R(o,oo) o F s0e (T z ,R m ,dP,-E z (T z )),R {0tOo) o F 5 °(M, R m , ^, - 7 )) < e 

with probability at least 1 — q. Since (E,j) satisfies Al, Lemma r4.1.1( i) tells us that 
for any q G (0, 1] and e > 0, there exists z G N such that for all z > z , E z is a 
(|, |)-density estimator of 7 . 

Choose e' G (0, |). Then § > tt- Since M has bounded absolute sectional 
curvature, it follows from Remark 14.3.31 that there exists some Z\ G N with Z\ > zq 
such that for all z > z t , AT(- 7 _ f , |L) e -*(i- ce '/ c ')v(-7-|,^) < 1 

Invoking Theorem 14.4.1( 1) with a = |, for all 2 > 

d w/ (i^_ fi0o) oF see (^ < | 

with probability at least 1 — g. 

Then by Lemma |3.3.6[ for all z > zi, 

dwi{R(0 >O o) ° F S(5e (T„ M m , tP, -£7,(T Z )), i2 (0)Oo) o F 5 °(M, R"\ <F, -7)) < | + \ = e 

with probability at least 1 — q. This completes the proof of (i). 

The proof of (ii) is essentially the same as that of statement (i), using LemmaHXTJ^ii) 
in place of Lemma [4.1.1( i). The proof of (iii) is also essentially the same as that of 
statement (i), using Theorem 14.4. 1 If ii) in place of Theorem 14.4. l( i). □ 

Corollary 4.4.3. For any i G Z>o, 

(i) Under the same assumptions as in the statement of Theorem \4-4-^ i), 



djiHioR^oF 3 ^, R m , tF, —E Z (T Z )), ^oi? (0jOo) oF so (M, R"\ d?, -7)) A 0. 
(ii) Under the same assumptions as in the statement of Theorem \4-4- ^( H ), 



d 7 (^oi?(o,oo)oF SCe (T 2 , R m , d p , —E Z (T Z )), #,oi? (0iOo) oF so (M"\ d p , ->y*K)) A 0. 
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(Hi) Under the same assumptions as in the statement of Theorem J^.J^.2(iii), 

d^oR^^oF^Tz, M, d M , —E Z (T Z )), H % oR MM)) oF so {M, R m , d M , -7)) A 0. 

Proof. This follows immediately from Theorems 14.4.21 and 13.3.91 □ 

4.4.2 Inference Results for Superlevelset-Rips Bifiltrations 

Convergence in Probability up to Interleavings 

To describe our asymptotic results for superlevelset-Rips nitrations it will be conve- 
nient for us to introduce some terminology. Let Ji, J 2 : K n — > M. n be increasing maps 
and let u e W 1 . We say that a sequence {X z } ze ^ of random w-filtrations converges in 
probability to a random u-filtration X up to weak (J 1; J 2 )-interleaving, and we write 

if for all e > 

lim P(X Z and X are weakly (J t o J 1; J t o J 2 )-interleaved) = 1. 

z— ¥00 

Note that X n P,ld "' ld "> X if and only if dwi{X n , X) 0. 

Similarly, we say that a sequence {M z } z£N of random _B„-persistence modules 
converges in probability to a random £? n -persistence module M up to weak (Ji, J 2 )- 
interleaving, and we write 

M z M 

if for all e > 

lim P(M Z and M are weakly (J e o J 1} J e o J 2 )-interleaved) = 1. 

2— »oo 

M„ P ' idn ' idn ) M if and only if dj{M z , M) A 0. 
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Theorem 4.4.4 (Asymptotics of estimation of superlevelset-offset filtration via su- 
perlevelset-Rips nitrations). 



(i) Under the same assumptions as in Theorem U^JJWi), 



R {%oo) oF SR {T z ,dP-E z {T z )) P ' J ' 1+1 ' idn+1 ) #(0,00) oF 5 °(M,M"\cF,- 7 ). 



(ii) Under the same assumptions as in Theorem \4-4- H .™ ) 



R (0>oo) o F SR (T Z , d\ -E Z {T Z )) "'^ +1 '' dn+1 ) R (0>oo) o F so (R m , #, -7 * K). 



(Hi) Under the same assumptions as in Theorem \4-4- Wiii ), 



R M M)) o F SR (T Z , d M , -E Z (T Z )) P '^ +1 ' id " +1 ) R {0AM)) o F so (M, d M , -j). 

Proof. To prove (i), we need to show that under the same assumptions as in The- 
orem I4.4.2( i). for any e > and q G (0,1], there exists some z' G N such that 
for all z > z', i2 (0)Oo) o F SR {T z ,dP,-E z {T z )) and R {0>oo) o F so (M, R m , dP, -7) are 
(J e o J7n+i, J e id n+ i)-interleaved with probability at least 1 — q. 

By ExampleE^ai F SR (T, d p , 7) and F s0e (T, R m , dP, 7) are (J n+1 , id n+ i)-interleaved. 

Theorem 14.4.21 tells us that there exists some z" > such that for all z > z", 
R(p,oo) ° F sde (T z , R m , dP, -E Z (T Z )) and R^) o F so (M, R m , d p , -7) are e-interleaved 
with probability at least 1 — q. 

By Lemma 13.3.71 then, for all z > z", R(o >00 ) o F SR (T z ,d p , —E Z {T Z )) and R(o )00 ) 
F so (M, R m , d p , — 7) are (J e o jTn+i, J £ ) -interleaved with probability at least 1 — q. 
Thus taking z' = z" gives the result. 

The proofs of (ii) and (iii) are essentially the same as that of (i), using Theo- 
rems H3^ii) and I4.4.27 iii) in place of Theorem 14.4. 2f i). □ 

Corollary 4.4.5. For any i G Z> , 

(%) Under the same assumptions as in the statement of Theorem \4-4- ^ )> 



^ o #(0,00) o F SR (T Z , dP, -E Z (T Z )) P '^ +1 ' idn+1 > H t o R {0>oo) o F so (M, R m , dP, -7). 
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(ii) Under the same assumptions as in the statement of Theorem ^.^.2[ ii), 



HioR (0iOo) oF SR (T z ,(F,-E z (T z )) P ^ +1 ' id " +1 ) ^ Oj R (0iOo) oF so (R m ,^,- 7 ^). 



(Hi) Under the same assumptions as in the statement of Theorem \4-4- Wiii ) 



Hi o R (0AM)) o F SR (T Z , d M , -E Z (T Z )) P '^" +1 ' idn+1 ) R i0AM)) o F 5 °(M, d M \ -7). 



Proof. This follows immediately from Theorem 14.4.41 and Theorem 13.3.91 □ 

4.5 Inference of the Super levelset Filtration of a 
Probability Density via Filtered Cech Com- 
plexes 

In this section we apply Theorem 14 . 2 . 1 1 1 1 o obtain a version of the inference theorem 
of [TTJ Theorem 5.1] which holds on the level of nitrations rather than only on the 
level of persistent homology modules. Unlike [HI Theorem 5.1], this result holds for 
only estimators defined using Cech nitrations, not for estimators defined using Rips 
nitrations; the result of (TTj Theorem 5.1] for estimators defined using Rips nitrations 
does not in general lift to the level of nitrations. 

We state the result for geodesic metrics on Riemannian manifolds; an analogous 
result holds for LP- metrics on submanifolds of W 71 . 

Theorem 4.5.1 (Inference of superlevelset filtrations). Let M be a Riemannian man- 
ifold, let 7 : M — )■ K be a c-Lipschitz density function w.r.t. d M for some c > 0. Let 
T z be a finite i.i.d. random sample ofT J 1 of size z. Let E z be a (q,C) -density estima- 
tor of another density function 7 : W 71 — > K for some C > and q G [0, 1]. Then for 
any e > 0, 5 > e, and a > ce. 

dwi(R(-a,p(M)) o F sC ' e (T z , M, d M , -E Z (T Z ), 5), R ( - aAM )) ° F S (M, -7)) < cd + C 
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with probability at least 1 — J\f(— 7_ a , |)e z(a C£ ^ v ^ 7 -<*>2) — g. 

Proof. The proof is the essentially same as the proof of Theorem 14.4. H using Theo- 
rem 14.2.111 in place of Theorem 14.2.31 □ 

Theorem 4.5.2 (convergence in probability of superlevelset-Cech nitrations with 
fixed scale parameter to superlevelset filtration of density function). Let M be a 
Riemannian manifold with bounded absolute sectional curvature, let 7 : M — > R be a 
c-Lipschitz density function (w.r.t. d M ) for some c > 0, and let T z be an i.i.d. sample 
ofV 1 of size z. 

(i) If M = M m and E is a density estimator on W 71 such that (E,^) satisfies Al 
then there exists a sequence of positive real numbers {S z } z( z^ such that 

dwi(R(o,oo) oF sGe (T z ,R m ,d p ,-E z (T z ),6 z ),R {0>OD) oF 5 (M m ,- 7 )) A 0. 

(ii) If M = M m and E is a density estimator on W 71 such that (E, 7) satisfies A2 
for some kernel K then there exists a sequence of positive real numbers {<5 2 } 2 eN 
such that 

dwi{R{o,oo) o F sde (T z , R m , d\ -E Z (T Z ), 5 Z ), R (0>Qo) o F s (R m , -7 * K)) A 0. 

(Hi) If E is a density estimator such that (E,j) satisfies Al then there exists a 
sequence of positive real numbers {d~ z } z< =n such that 

dwi(R(o, P (M)) ° F sde (T z , M, d M , -E Z (T Z ), 6 g ), R M M)) ° F S (M } -7)) A 0. 

Proof. We'll prove (iii). The proofs of (i) and (ii) are essentially the same, using 
Lemma 14. 1.1 I f ii) in place of Lemma I4.1.1( i) and an analogue of Theorem 14.5.11 for L p 
metrics on R m . 

To prove (i), we need to find a sequence of positive real numbers {<5 2 } 2e N such 
that for any e > and p G (0, 1], there exists some Z\ G N such that for all z > Z\, 

dwi(R(o, P (M)) o F sde (T zl M, d M , -E Z (T Z ), S z ),R (0A m)) ° F S (M, -7)) < e 
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with probability at least 1 — p. To construct our sequence {5 z } zg Nj we first choose a 
monotonically decreasing sequence £ : N — > (0, 1] such that lim^oo £(z) = 0. Since 
(E, 7) satisfies Al, Lemma l4.1.1f i) tells us that there exists a strictly increasing 
sequence of natural numbers j : N — > N such that for any y G N and any z > j(y), 
E z is a (^y-, ^-)-density estimator of 7. 

Choose a second monotonically decreasing sequence £' : N — > (0, 1] such that for 
each z G N, < ^). Let c' = max(l,c). For each z G N, > Since 

M has bounded absolute sectional curvature, it follows from Remark 14.3.31 that there 
exists a strictly increasing sequence of natural numbers I : N — > N with l(z) > j(z) 
for all z G N such that for y G N and all z > l(y), 

AT(-7_cM,^r)e - <— . 

Invoking Theorem 14.5.11 with the variables (a, e, 5) in the statement of that The- 
orem set equal to (^^, ^7^, ^r^), we obtain the following result, which we state as a 
lemma. 

Lemma 4.5.3. For all y G N, £'{y), l(y) defined as above, and all z > l(y) 
dwi(R { _m A M)) ° F sCe (T z , M,d M , -E z (T z ),e,\y)), fl ( _sM, p( M)) ° fS ( m . 

<('(y) + ^<m 

with probability at least 1 — £(?/). 
Now define Z" 1 : N ->■ N by 



V\z) 



max{z' G N|Z(z') < 2} if 1(1) < z, 
1 otherwise. 



Define {d z } zE f$ by taking 5 Z = 1 (z)) for all z6l 

Now choose e > and p G (0, 1]. There's some y G N such that < min(e,p). 
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We claim that for all z > l(y), 

dwi{R { _m p{M)) oF sGe (T z ,MJ M ,-E z (T z ),6 z ),R ( _^ AM)) oF so (M, -7)) < ^ 

with probability at least 1 — £(y). 

Then, given the claim, we set Z\ = l(y). By Lemma [3.3.61 for all z > Z\, 

dwi(Rt.0AM)) o F sC '(T„ M, d M , -E,(T,),6,), B(o, p (m)) o F S (M, -7)) 

<m + m =m < t 

with probability at least 1 — £(y) > 1 — p, which completes the proof of (i). 
To prove the claim, we plug in y = l^ 1 (z) into Lemma [4.5.31 to obtain that 

dwi(R { p(M)) o F s6 '(T z , M, d M , -E z {T z ),£(r\z))), R { s!!zllM p(M)) o F S (M, - 

< f(r . w ; + 'fqwi < «qwi 

with probability at least 1 — 

Since / is strictly increasing, when z > /(?/), l^ 1 {z) > = y. Since the 

sequences £ and £' are monotonically decreasing, we then have that < £(y) 

and <^(2/)- Thus, 

<iM 2 

- 2 

with probability at least 1 — £(y). Applying Lemma [3.3.51 and using the definition of 
5 Z , we obtain that 



dwi(R ( _<M A M)) ° F SC '(T Z , M,d M , -E Z {T Z ),S Z ),R^ AU)) o F so (M, -7)) 

< C(v) + ^ < ^ 
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with probability at least 1 —£(?/). This proves the claim and completes the proof of 



4.6 Future Work on Persistence-Based Topological 
Inference 

We close this chapter with a discussion of directions for future work on the statistical 
aspects of persistence-based topological inference. 

Almost Sure Convergence Results 

Our asymptotic results are formulated in terms of convergence in probability and a 
variant of it which we have called convergence in probability up to interleaving s. It 
would be interesting to know whether our results can be strengthened to almost sure 
convergence results. 

Bootstrap Confidence Balls for Persistence 

The natural next step in the development of the theory of persistence-based topologi- 
cal inference is to develop a theory of confidence regions for the consistent estimators 
studied in this chapter. This problem has hitherto not been addressed in any pub- 
lished work, for even for 1-D persistent homology. 

In the setting of Theorem 14.4.11 it seems that it should be possible to employ 
the bootstrap methodology to compute approximate confidence balls (in the 
pseudometric space of multidimensional nitrations given by the weak interleaving 
interleaving distance) for a superlevelset-Cech estimator X of the superlevelset-offset 
filtration Y of a probability density function. 

For a e (0, 1), to compute an approximate a-confidence ball for X the simplest 
and most natural approach would be to 

1. Construct bootstrap replicates X%, Xi of the filtration X by taking superlevelset- 
Cech nitrations of bootstrap replicates of the point cloud data on which X was 
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built. 

2. Compute the smallest value of y such that a fraction a of the bootstrap replicates 
Xx, ...Xi lie within weak interleaving distance y of X. 

We expect that, under mild hypotheses, the ball of radius y centered at X would then 
be an approximate a-confidence ball for X, and that an analogous result would hold 
on the level of persistent homology modules. 

A well developed theory along these lines of bootstrap confidence balls for persis- 
tent homology estimates, together with an efficient computational pipeline for com- 
puting the radii of confidence balls, could add in a significant way to power of persis- 
tent homology as a tool for understanding the qualitative structure of random data. 

Testing Persistence-Based Hypotheses 

Another important problem in topological inference is to develop a theory of hypoth- 
esis testing for hypotheses formulated using the language of persistence. Given the 
close relationship between hypothesis testing and computation of confidence regions, 
this problem is much in the same spirit as the problem of computing approximate 
confidence balls for the estimators of this thesis. 

The problem of testing hypotheses formulated using persistence has hitherto not 
yet been treated by the topological data analysis community, yet seems to be of basic 
importance, particularly in the case of th persistent homology. To explain, on the 
one hand th persistent homology offers an elegant and user-friendly language for 
describing the multi-modality of functions in a way that is sensitive to the "size" of 
the modes. On the other hand, mode detection of densities and regression functions 
is an old and well studied problem in statistics, and one of fundamental interest — see, 
for example [40J and (HI Chapter 20] . Classical statistics has developed approaches to 
testing for mult i- modality based on smoothing ideas [ID] , but as the classical language 
for describing multi-modality is impoverished relative to that offered by persistent 
homology, there seems little doubt that persistent homology would have something 
significant to offer to the problem of testing multi-modality hypotheses. 
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Inference of Persistent Homology at the Cycle Level 

It is possible to geometrically represent the persistent homology module of a filtration, 
at the chain level, by choosing a presentation for the persistent homology module and 
then choosing representative cycles and boundaries for the generators and relations in 
the presentation. There are of course many different ways to make such choices, most 
of which are not geometrically interesting. However, for nitrations whose topological 
spaces are equipped with a sufficient amount of geometric structure we can in principle 
define notions of optimality of chain-level representations of persistence modules in 
such a way that the optimal choice will be geometrically meaningful. Then, using 
such notions, we can pose persistence-based inference problems directly on the chain 
level. The results on clustering of [TT] are loosely in this spirit, for th persistent 
homology. It would be very interesting to understand if and how those results adapt 
to higher persistent homology and to the multidimensional setting. 

The problem of pursuing inference at the chain level is an extremely important 
one, at least to the extent that we are interested in the use of persistent homology 
for topological inference: A descriptor of a probability distribution defined only using 
the isomorphism class of a persistent homology module gives us only indirect infor- 
mation about the qualitative structure of the probability distribution; ultimately we 
want to understand how the algebraic features of the persistent homology descriptor 
correspond to specific geometric features of the probability distribution. For that we 
need to consider representatives of those algebraic features on the chain level. 



Chapter 5 
Conclusion 



In this thesis, we have introduced and studied (Ji, ^-interleavings on multidimen- 
sional persistence modules and strong and weak (Ji, J2)-interleavings on multidimen- 
sional nitrations. We have undertaken a careful study of such interleavings and of the 
interleaving distances dj, dsi, and dy/i defined in terms of them. We have applied 
interleavings and interleaving distances to adapt the persistence-based topological in- 
ference result [TTJ Theorem 5.1] to the multidimensional setting and directly to the 
level of nitrations. The culmination of our efforts is a pair of results, Theorems 14.4.21 
and 14.4.41 describing the topological asymptotics of random sublevelset-Cech and 
sublevelset-Rips bifiltrations (filtered by the superlevelsets of a density estimator) 
in the large sample limit. These theorems put on firm mathematical footing the 
idea, first put forth in [TJ, that such nitrations should encode topological information 
about the probability density function of a probability distribution generating the 
point cloud data on which the bifiltrations are built. 
One of the central themes of this thesis has been that 

1. To formulate results in the theory of persistence-based topological inference, 
one first needs to select notions of similarly between nitrations and persistence 
modules. 

2. Much of the substantive theoretical work to be done in the study of topological 
persistence lies in understanding what the right choices of those notions of 
similarity are. 
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Indeed, we have spent the bulk of our effort in this thesis developing the theory 
of interleavings and interleaving distances. The theory we have developed makes a 
strong, if still incomplete, case that interleavings are natural tools with which to 
frame the theory of topological inference of multifiltrations. 

Our discussions of directions for future work in Sections |2.12[ I3.3.7[ and 14.61 make 
it clear that there is much more work to be done on the theory of interleavings of 
multidimensional persistence modules, on the theory of weak interleavings of multi- 
dimensional nitrations, and on the statistical aspects of persistence-based topological 
inference. I believe that progress in these directions, seen as a part of the larger pro- 
gram of fleshing out the statistical foundations of topological data analysis, stands a 
good chance of contributing something fundamental to the way scientists, engineers, 
and statisticians think about and perform the qualitative analysis data. 



Appendix A 



Minimal Presentations of 
Multidimensional Persistence 
Modules 

A.l The Coherence of B n 

A. 1.1 Coherence: Basic Definitions and Results 

k[x±, ...,x n ] is well known to be a Noetherian ring. Finitely generated modules over 
Noetherian rings have some very nice algebraic properties. Here we define a standard 
weakening of the Noetherian property called coherence. Analogues of many of the 
same nice algebraic properties that hold for finitely generated modules over Noethe- 
rian rings hold for finitely presented modules over coherent rings. In particular, we 
have Corollary IA.1.4| which we will use in Appendix IA.2I to prove Theorem 12.1.41 

Definition. For R a ring, we say an R-module M is coherent if M is finitely 
generated and every finitely generated submodule of M is finitely presented. We say 
a ring R is coherent if it is a coherent module over itself. 

Coherent commutative rings and coherent modules are well studied; the following 
results are standard. The reader may refer to [2B] for the proofs. 
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Proposition A. 1.1. If R is a Noetherian ring then R is coherent. 

Theorem A. 1.2. If R is a coherent ring then every finitely presented R-module is 
coherent. 

Theorem A. 1.3. If f : M — )■ iV is a morphism between coherent R-modules M and 
N then ker(/) ; im(/), and coker(/) are coherent R-modules. 

Combining these last two theorems immediately gives 

Corollary A. 1.4. If R is a coherent ring and f : M — )■ iV is a morphism between 
fintely presented R-modules M and N, then kei(f), im(/) ; and coker(/) are finitely 
presented. 

A. 1.2 The Ring B n is Coherent 
Theorem A. 1.5. For any n 6 N, B n is coherent. 
Proof. The key to the proof is the following theorem: 

Theorem A. 1.6 ([28, Theorem 2.3.3]). Let {R a }a£S be a directed system of rings 
and let R = lim^ R a . Suppose that for a < (3, Rp is a flat R a module and that R a is 
coherent for every a. Then R is a coherent ring. 

First, recall that M is a vector space over Q. We'll say that a%, ...,ai G M>o are 
rationally independent if they are linearly independent as vectors in R over the field 
Q. 

We next extend this definition to vectors in R> : We say a finite set V C M> is 
rationally independent if 

1. V is the union of sets V\,...,V n , where each element of Vi has a non-zero i th 
coordinate and all other coordinates are equal to zero. 

2. For any i, if a\, a% are the non-zero coordinates of the elements of Vi (listed 
with multiplicity), then ai, ai are rationally independent in the sense defined 
above. 
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We define an n-grid to be a monoid generated by some rationally independent set 
V C K> . Denote the n-grid generated by the rationally independent set V as r(V). 
r(V) is a submonoid of IR> . 

Lemma A. 1.7. If V is a rationally independent set, then the n-grid generated by V 
is isomorphic to Z>q. 

Proof. The proof is straightforward; we omit it. □ 

As noted in Section |2.1.2[ for any m G N, fc[Z> ] = k[xi, x m ]. As the latter 
ring is Noetherian, it is coherent by proposition IA. 1 . II Thus if G is an n-grid, k[G] 
is coherent. 

Lemma A. 1.8. For any finite set A C M>o, there's a rationally independent set 
B C M>o such that A lies in the monoid generated by B. 

Proof. We proceed by induction on the number of elements I in the set A. The base 
case is trivial. Now assume the result holds for sets of order I — 1. Write A = 
{ax, ...,ai}. By the induction hypothesis there exists a finite rationally independent 
set A' = {a' 1; a' m } such that {a\, ai-\} lies in T(A'). If A' U a\ is rationally 
independent, take B = A' U a%. Otherwise ai = qia[ + ... + g m _ 1 a^ n „ 1 for some 
qi, ...,qi-i G Q; we may take B = {?i a ij •••) 9t-i a m-i}> where q\ = 1/bi for some 
hi G N such that g» = a /hi for some a G Z>o- □ 

Lemma A. 1.9. The set ofn-grids forms a directed system under inclusion with direct 
limit M| . 

Proof. To show that the set of n-grids forms a directed system, we need that given 
two n-grids Q\ and G 2 , there's an n-grid G 3 such that G\ C G 3 and G 2 C G 3 . This 
follows readily from Lemma IA.1.8( we leave the details to the reader. Any element of 
M> lies in an n-grid, so M> must be the colimit of the directed system. □ 

For a monoid A and a submonoid A' <Z A, we have k[A'] C k[A\. This implies the 
following: 
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Lemma A. 1.10. The set of rings {k[G]\G is ann-grid} has the structure of a directed 
system induced by the directed system structure on the set of n-grids, and B n is the 
direct limit of this directed system. 

Proposition A. 1.11. Given two positive n-grids G',G with G' C G, k[G] is a free 
k[G'] module. 

Proof. We begin by establishing a couple of lemmas. 

Lemma A. 1.12. For any rationally independent set V and n-grid A containing 
r(V'), there is a rationally independent set V such that A = T(V) and such that for 
each a G V , V contains an element of the form a/b for some b G N. 

We call V an extension of V. 

Proof. The proof of Lemma IA. 1.121 is similar to the proof of Lemma IA.1.8t we omit 
it. □ 

Let S denote the set of maximal sets of the form g + G' = {g + g'\g' G G'} for 
some g G G. 

Lemma A. 1.13. The sets S form a partition of G. 

Proof. It's enough to show that if g 1 + G', g 2 + G' eS and g 1 + G' H g 2 + G' ^ 0, then 
9i + G' = g 2 + G'. 

Let V be a rationally independent set with r(V) = G', and let V be an extension 
of V with T{V) = G. Write V = ...,!;/} and V = {v x /b x , u,/6 { , v l+1 , ...,v m } 
for some bi, bi G N. 

Assume there exist g[, g' 2 G G' such that gi + g[ = g 2 + g' 2 - We'll show that there 
then exists an element g% G G such that g\,g 2 G g^ + G. By the maximality of g\ + G' 
and g 2 + G', this implies g x + G' = g 2 + G', as needed. We write 

91 = yivi/h H h yivi/bi + y l+1 v l+1 H h y m Vm, 

92 = z 1 v 1 /b l H h zivjbi + zi +1 v l+1 H h z m v m , 

g'i = y[v! + ■ ■ ■ + yfa, 

9 2 — z[vi H h z[vi. 
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for some yi, y m , zi, z m , y[, y' t , z[, z[ G Z. By the rational independence of 
V and the fact that g\ + g[ = g 2 + g' 2 , we have that yi = Zi for I + 1 < % < m. 



g 3 = min(?/i, zi)vi/h H h vnxs\{y h z{)vi/bi + yj+i^+i H h y m v m , 

91 = (Vi - mind/x, *i) W&i H h (yi - min(^, zi))v t /b h 

92 = ( z i ~ min (2/i> Zi)M/&i H h - min(yj, zi))vi/bi. 

93 + <7i = #i an d 53 + #2 = 02, so if we can show that g", g 2 ' G G" we are done. 

By the rational independence of V and the fact that gi + g[ = g 2 + g' 2 , ior 1 < i < I 
we have that min(yj, + max(^, z-) = s/j/frj + y[. This implies that max(y^, — 
Hi = (Vi ~ mm (z/i ; Zi))/bi- In particular, the term on the right hand side lies in Z> . 
Thus g'l G G' . The same argument shows g' 2 ' E G'. □ 

Now we are ready to complete the proof of Proposition IA. 1.111 It's easy to see 
that for any s G S, the natural action of G' on s extends to give k[s] the structure of 
a free k[G'] module of rank 1. It follows from Lemma [A. 1 . 1 31 that the sets {/c[s]} se s 
have trivial intersection as /c[G']-submodules of k[G]. We then have that as a k[G'] 
module, k[G] = ® s esk[s], and so in particular k[G] is a free fc[Gr']-module, as we 
wanted to show. □ 

Given Lemma IA. 1.101 and Proposition IA.1.11} Theorem IA.1.61 applies to give that 



A. 2 Minimal Presentations of B^-persistence Mod- 
ules 

This section is devoted to the proof of Theorem 12.1.41 



Define 




□ 



APPENDIX A. MINIMAL PRESENTATIONS 



157 



A.2.1 Free Hulls 

We first observe that some standard results about resolutions and minimal resolutions 
of modules over local rings adapt to _B n -persistence modules. We'll only be interested 
in the specialization of such results to the th modules in a free resolution, and for 
the sake of simplicity we phrase the results only for this special case. However, 
the results discussed here do extend to statements about free resolutions of finitely 
presented _B n -persistence modules. 

Let m denote the ideal of B n generated by the set 

{v G B n \v is homogeneous and gr(v) > 0}. 

Define a free hull of M to be a free cover (Fm, Pm) such that ker(p^) C v&Fm- 

Nakayama's lemma |25j is a key ingredient in the proofs of the results about free 
resolutions over local rings that we would like to adapt to our setting. To adapt these 
proofs, we need an n-graded version of Nakayama's lemma. 

Lemma A.2.1 (Nakayama's Lemma for Persistence Modules). Let M be a finitely 
generated B n -persistence module. If y\, y m G M have images in M/mM that 
generate the quotient, then y%, ...,y m generate M. 

Proof. The usual Proof of Nakayama's lemma [25J carries over with only minor 
changes. □ 

Lemma A. 2. 2. A free cover (Fm,Pm) of a finitely generated B n -persistence module 
M is a free hull iff a basis for Fm maps under pu to a minimal set of generators for 
M. 

Proof. Given the adaptation Lemma [A. 2. II of Nakayama's lemma to our setting, the 
proof of [251 Lemma 19.4] gives the result. □ 

It follows easily from Lemma [A. 2 .21 that a free hull exists for any finitely generated 
-B n -persistence module M. Corollary IA.2.41 below gives a uniqueness result for free 
hulls. 
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Theorem A. 2. 3. If (Fm,Pm) ^ s a f ree hull of a finitely presented B n -persistence 
module M and (F' M , p' M ) is any free cover of M , then Fm includes as a direct summand 
of F' M in such a way that F' M = F M © F M for some free module F M , and ker(p' M ) = 
ker(p M ) © F M C F M © 

Sketch of Proof. The statement of the theorem is the specialization to th modules in 
the free resolutions of M of an adaptation of [251 Theorem 20.2] to our 5 n -persistence 
setting. To modify Eisenbud's proof of [25"1 Theorem 20.2] to obtain a proof of The- 
orem IA.2.31 one needs to invoke the coherence of B n and use Corollary IA. 1.41 to 
show that ker(p^f) is finitely generated. Given this, the strategy of proof adapts in a 
straightforward way. □ 

Corollary A. 2. 4 (Uniqueness of free hulls). If M is a finitely presented B n -persistence 
module, and (Fm,Pm), (F' M ,p' M ) are two free hulls of M, then there is an isomor- 
phism from Fm to F' M which is a lift of the identity map of M. 

Proof. By Theorem IA.2.31 we can identify Fm with a submodule of F' M in such a way 
that F' M = F M ®F M for some free module F M and ker(p' M ) = kex{p)@F' M C F M ®F M . 
Since F' M is a free hull, we must have ker(p' M ) e mF' M) which implies F M = 0. The 
result follows. □ 

Corollary A. 2. 5. If M is a finitely presented B n -persistence module and B,B' are 
two minimal sets of generators for M , then gr(B) = gr(B'). 

Proof. This follows from Corollary IA.2.41 and Lemma IA.2.21 □ 
A.2.2 Proof of Theorem 12X41 

Recall that a minimal presentation (G\R) of a _B n -persistence module M is one such 
that 



1. the quotient (G) — > (G) / (R) maps G to a minimal set of generators for (G) / (R). 

2. R is a minimal set of generators for (R). 
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Let M be a finitely presented _B n -persistence module. Let (G\R) be a minimal 
presentation of M. We need to show that for any other presentation (G'\R') of M, 
gr(G) < gr(G') and gr(R) < gr(R'). 

Let ip : (G)/(R) -»■ M and 0' : (G'}/(R'} -»■ M be isomorphisms, let vr : (G) -»■ 
(G) / (R) and 7r' : (G") — > (G , )/(R r ) be the quotient homomorphisms, let p = -0 o 71", 
and let p' = ip' o n'. Then by Lemma [A. 2. 2[ ((G), p) is a free hull of M, and ((G'), p') 
is a free cover of M. 

By Theorem IA.2.3t (G) includes as a direct summand of (G') . The image of G 
under this inclusion can be extended to a basis for (G'). Recall that if B and B' are 
two bases for a free -B„-persistence module F, then gr(B) = gr(B'). We thus have 
that gr(G) < gr(G'). 

Theorem IA.2.31 also implies that (R 1 ) = (R) © F for some free _B n -persistence 
module F. Let B be a basis for F . Then R U B is a minimal set of generators for 
(i?) © .F. Let i?" denote the image of R' under an isomorphism from (R 1 ) to (R) © F 
and let p : (i?) © F — > (R) denote projection onto the first summand. Since p is 
surjective, p(R") is a set of homogeneous generators for (R). 

Since (G) and M are finitely presented, by Corollary |A. 1.41 ker(p) = (R) is also 
finitely presented. Then by Corollary IA.2.5| gr(R) < gr(p(R")). Since gr(p(R")) < 
gr(R") = gr(R') we have that gr(R) < gr(R'). □ 
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